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Abstract 

The Ghirardi-Rimini" Weber (GRW) theory of spontaneous wave function col- 
lapse is known to provide a quantum theory without observers, in fact two different 
ones by using either the matter density ontology (GRWm) or the flash ontology 
(GRWf). Both theories are known to make predictions different from those of 
quantum mechanics, but the difference is so small that no decisive experiment can 
as yet be performed. While some testable deviations from quantum mechanics 
have long been known, we provide here something that has until now been miss- 
ing: a formalism that succinctly summarizes the empirical predictions of GRWm 
and GRWf. We call it the GRW formalism. Its structure is similar to that of the 
quantum formalism but involves different operators. In other words, we establish 
the validity of a general algorithm for directly computing the testable predictions 
of GRWm and GRWf. We further show that some well-defined quantities cannot 
be measured in a GRWm or GRWf world. 

PACS: 03.65. Ta. Keywords: quantum theory without observers; Ghirardi-Rimini- 
Weber (GRW) theory of spontaneous wave function collapse; empirical predic- 
tions; quantum measurement theory; predicted deviations from quantum mechan- 
ics; primitive ontology; limits on knowledge; positive-operator-valued measure 
(POVM); completely positive superoperator. 



*Departments of Mathematics, Physics and Philosophy, Hill Center, Rutgers University, 110 Frel- 
inghuysen Road, Piscataway, NJ 08854-8019, USA. E-mail: oldstein@math.rutgers.edu 

^Department of Mathematics, Hill Center, Rutgers University, 110 Frelinghuysen Road, Piscataway, 
NJ 08854-8019, USA. E-mail: tumulka@math.rutgers.edu 

■'■Dipartimento di Fisica dell'Universita di Genova and INFN sezione di Genova, Via Dodecanese 33, 
16146 Genova, Italy. E-mail: zanghi@ge.infn.it 



1 



Contents 



1 Introduction 3 

1.1 A First Look at tlie GRW Formalism 4 

1.2 Role of the Primitive Ontology 7 

1.3 Status of the Derivation [7 

2 The GRWm and GRWf Theories 

2.1 The GRW Jump Process in Hilbert Space 

2.2 GRWm 

2.3 GRWf 

2.4 Empirical Equivalence 

2.5 Systems 

3 Mathematical Tools 

3.1 POVM 

3.2 The Distribution of the Flashes 

3.3 The Conditional Probability Formula 

4 How Operators Emerge 

5 The Quantum Formalism 

5.1 Mathematical Tool: Completely Positive Superoperators 

5.2 The Formalism 

5.3 First Example 

5.4 Consistency Between Superoperators and POVM 

5.5 Another Example 

5.6 The Law of Operators 

6 The GRW Formalism 

6.1 The Formalism 

6.2 Isolated System 

6.3 Density Matrix 

6.4 Smallness of Deviations 

7 Derivation of the GRW Formalism 

7.1 Density Matrix 

7.1.1 Statistical Density Matrix 

7.1.2 The Marginal Probability Formula 

7.1.3 The Marginal Master Equation 

7.1.4 State and Primitive Ontology 

7.2 Derivation of the GRW Formalism 

7.3 An Example: Consecutive Experiments 

7.3.1 Conditional Density Matrix 

7.3.2 Conditional Distribution of Pointer Positions 



2 



8 Random Run-Time |40 

8.1 GRW Formalism for Random Run-Time 4a 

8.2 Derivation ii 

8.3 Quantum Formalism for Random Run- Time is] 

9 Genuine Measurements |49 

9.1 Limits on Knowledge 49 

9.2 If Information About the Initial Wave Function Is Given 51 

9.3 The Quadratic Functional Argument |52 

10 Conclusions [53 
A Proof of the Conditional Probability Formula |54 
B Check of Consistency Conditions [55 
C Proof of the Marginal Probability Formula [56 
D Proof of the Marginal Master Equation |57 



1 Introduction 

This paper is about the derivation of statistical predictions for macroscopic behavior 
from a specific microscopic physical model. That is common in statistical physics. A bit 
unusual, though, is that the microscopic model we study was developed for explaining 
quantum mechanics. Indeed, in order to obtain a quantum theory without observers, and 
thus to solve the measurement problem and other paradoxes of quantum mechanics, 
it has been suggested that one should incorporate spontaneous collapses of the wave 
function into the laws of nature by replacing the Schrodinger evolution with a stochastic 
and nonlinear evolution law. The simplest and best known proposal for such a law is due 
to Ghirardi, Rimini, and Weber (GRW) [2Sl[n] (see [B] for a review of collapse theories). 
This is the framework we are concerned with in this paper. Our goal is to obtain the 
axioms of quantum mechanics as theorems in the GRW theory. 

To complete the GRW theory, one needs to specify a choice of primitive ontology 
(PO) and a law determining how the wave function governs the PO (see [3J for a discus- 
sion). Two possibilities for the PO and its law have been proposed: the matter density 
ontology and the flash ontology, leading to two different theories we shall denote GRWm 
and GRWf, respectively, in the following. We recall their definitions in Section [2l It is 
known that GRWm and GRWf are empirically equivalent, i.e., that they make exactly 
and always the same empirical predictions [3j; we describe the reasons in Section [2.41 
The first purpose of this paper is to derive what these predictions actually are. By 
"empirical predictions" we mean those predictions that can be tested in experiment; 
we will see that there are also predictions that cannot be so tested. The totality of all 
empirical predictions of a theory we also call the empirical content of the theory. 



3 



While GRWm and GRWf are designed to imitate quantum mechanics, they have 
been known since their inception to deviate from quantum mechanics, and a number of 
particular predictions differing from those of quantum mechanics have been identified 
[28l [39l [36l [311 [1] (for an overview of attempts to test GRW theories against quantum 
mechanics, see [^). Nonetheless, in practice the GRW theories tend to agree extremely 
well with quantum mechanics: for small systems, collapses are too rare to be noticed, 
while the breakdown of macroscopic superpositions is hard to test because of decoherence 
(for explicit figures about how closely GRW theories agree with quantum mechanics, see 
[8]). Thus, the theorems we prove are not precisely the axioms of quantum mechanics, 
but very close. 

Is there a general scheme of predictions, or an algorithm for directly calculating the 
predictions, of the GRW theories, in particular where they differ from quantum me- 
chanics? In this paper, we answer this question in the positive and provide a formalism, 
which we call the GRW formalism, summarizing the empirical predictions of the GRWm 
and GRWf theories. (Indeed, GRWm and GRWf give rise to the same formalism; they 
have to, because they are empirically equivalent.) The GRW formalism is analogous 
to the quantum formalism of orthodox quantum theory that describes the results of 
quantum experiments in terms of operators as observables, spectral measures, and the 
like. The main difference between the two formalisms lies in the relevant operators. 

The second main innovation of this paper, besides the formulation of the GRW 
formalism, concerns the nature of the argument used in deriving it: the argument is 
based on the primitive ontology of the theory. 

A third contribution of this paper is to identify questions that possess a unique 
answer in a GRW world but cannot be answered by the inhabitants of that world by 
means of any experiment. The following question is presumably of this type: How many 
collapses occurred in a certain system during the time interval [^1,^2]? 



1.1 A First Look at the GRW Formalism 

The GRW formalism can be formulated in a way similar to the formalism of quantum 
mechanics using operators in Hilbert space. We will give the complete formulation in 
Section [61 Put succinctly, the difference between the quantum and the GRW formalism 
is 

different evolution, different operators. 

"Different evolution" means that the unitary Schrodinger evolution is replaced by a 
master equation for the density matrix pt (a Lindblad equation, or quantum dynamical 
semigroup): 

^ = -i[H,p,] + xf2 I d'xK,{xf/^p,K,{xf'^-N\p,. (1) 
k=i ^ 

For readers who are not familiar with this type of equation, we note that the term 
— jj^[H,pt] represents the unitary evolution, with H the Hamiltonian, while the further 
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terms, the deviation from the unitary evolution, have the effect that the evolution ([T]) 
transforms "pure states into mixed states," i.e., transform density matrices that are 1- 
dimensional projections into ones that are not. Equation ([T]) holds for the density matrix 
Pt corresponding to the probability distribution of the random GRW wave function 
arising from a fixed initial wave function '^to- Concerning the notation, A > is a 
constant, and the positive operators A/(.(x) are the collapse rate operators (see Section [2] 
for the definition). 

"Different operators" means that "observables" are associated with different opera- 
tors than in quantum mechanics. This requires some explanation. A precise statement 
(which forms a crucial part of the GRW formalism) is that with every experiment S , 
there is associated a positive-operator-valued measure (POVM) E{-) such that the prob- 
ability distribution of the random outcome Z of S , when performed on a system with 
density matrix p, is given by 

F{Z e B) =tT{pE{B)) (2) 

for all sets This statement, the main theorem about POVMs, is valid in quantum 
mechanics as well as in GRW theories, but the POVM i?^^^(-) associated with S' 
in a GRWm or GRWf world is different from the POVM E^^{-) associated with (f in 
quantum mechanics. We prove this statement in Section |H However, we do not compute 
any specific operators for specific experiments, but derive only an abstract and general 
characterization of i?™^(-). 

When talking about every experiment, we mean that any possible future advances of 
technology are included. The assumptions that define our concept of "experiment" are: 
it involves a system (the object on which the experiment is performed) and an apparatus; 
it is possible to consider the same experiment for different states of the system, whereas 
changing the apparatus counts as considering a different experiment; at the time at 
which the experiment begins, the system and the apparatus are not entangled. 

Some colleagues that we have discussed this topic with have found it difficult to 
imagine how GRW could lead to different operators. When speaking of different opera- 
tors, we were asked, does that mean that the momentum operator is no longer —ifiVl 
No, it does not mean that. It means that, given any experiment in a quantum world, one 
can consider the same experiment in a GRWm or GRWf world, and the statistics of the 
outcome of that experiment are different from those in quantum mechanics — given by a 
different operator, or different POVM. Which operator should be called the "momentum 
operator" remains a matter of convention, and indeed there are reasons to call —iKV 
the "momentum operator" also in the GRW theories|§ Similarly, we would say that the 
"position observable" is the same in the GRW theories as in quantum mechanics, even 

"'^Here ¥{Z £ B) denotes the probability of the event Z £ B] sets are always assumed to be measur- 
able. The notion of "POVM" is defined in Section [XT] 

^Some "observables" of the quantum formalism — the momentum, angular momentum, and energy 
operators — are the generators of symmetries of the theory, such as translation, rotation, and time 
translation invariance. By virtue of Noether's theorem, then, they commute with the Hamiltonian. 
Since GRWm and GRWf, too, are translation, rotation, and time translation invariant (if the interaction 
potential is), the same self-adjoint operators occur here in the role of generators of symmetries (and 



5 



though concrete experimental designs for "measuring position" may lead to different 
outcome statistics than in quantum mechanics. 

We were also asked, when speaking of different operators, whether we refer to the 
Heisenberg picture? No, we do not. The question means this: If the time evolution is not 
unitary then the Heisenberg picture (or whatever replaces it for a master equation such 
as ([1])) should attribute to all observables different operators than standard quantum 
mechanics. But the "different operators" arise even in the Schrodinger picture: If the 
observation of the system (i.e., the period of its interaction with the apparatus) begins 
at time s and ends at t, then one is supposed, according to the GRW formalism, to 
evolve the system's density matrix until time s using ([1]) in the Schrodinger picture, and 
insert into the formula ([2]) the resulting ps, corresponding to what one feeds into the 
apparatus]^ 

Maybe the reason why many physicists find it difficult to understand that the GRW 
formalism involves different operators arises from regarding the operators of quantum 
mechanics as something that came into the theory by means of a second postulate besides 
the Schrodinger equation, the measurement postulate. From such a picture one might 
expect that the measurement postulate should remain unchanged, and, hence, also the 
operators, even when the Schrodinger equation is modified. The GRW perspective, 
however, forces us to proceed differently since it contains no measurement postulate, 
and predictions must be derived instead from postulates about the primitive ontology. 
This makes it evident that the measurement postulate and the Schrodinger equation 
actually never were independent, and that the operators depend on the evolution law, 
for example because the experiment's outcome depends on the evolution law of the 
apparatus. The GRW perspective also forces us to make precise what it means to say 
that a certain observable has operator A. We take it to mean that A encodes the outcome 
statistics, in the sense that the relevant experiment has outcome statistics given by ([2]) 
with E{-) the spectral projection-valued measure (PVM) of A. 

The master equation ([T]), or very similar equations, also arise in the theory of de- 
coherence. As a closely related fact, the GRW formalism would in principle also hold 
in a hypothetical quantum world in which decoherence is inevitable and affects every 
system in the same way, corresponding to ([1]). (In practice, of course, decoherence, 
due to interaction with the environment, cannot correspond to ([1]) in exactly the same 
way for every system because different systems have different environments and inter- 
act with their environments in different ways.) Let us underline the difference between 
deriving the GRW formalism from the quantum formalism together with the right dose 
of decoherence corresponding to ([1]), and deriving it from GRWm or GRWf: A deriva- 

commute with the Hamihonian) , even though a particular experiment that "measures," in quantum 
mechanics, momentum, angular momentum, or energy may, in the GRW formalism, be associated with 
different operators. 

•^But some connection with the Heisenberg picture exists indeed: keep in mind that the main theorem 
about POVMs concerns any experiment S'; for example, S' could consist of waiting for a while At and 
then "measuring position." Then, the quantum operator associated with is the Heisenberg-evolved 
position operator, Qg — e^^^*Qe~'^^'^* , and the reader will be able to imagine that in GRWm or GRWf 
there is a different operator (in fact, a POVM) associated with S". 
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tion starting from quantum mechanics would assume statements about the outcomes 
of experiments (the measurement postulate) to deduce other statements about the out- 
comes of experiments. When starting from GRWm or GRWf, in contrast, we assume 
statements about the primitive ontology, and derive that, e.g., pointers point in certain 
directions. 

It is an interesting side remark that Bohmian mechanics [TU [11] can be so modified 
as to become empirically equivalent to GRWm and GRWf. This modified version is 
described in |1] under the name "MBM." Its empirical content is also summarized by 
the GRW formalism. As a consequence, the empirical content of the GRW theories can 
as well be obtained with a particle ontology, and is not limited to the flash and matter 
density ontologies. 

1.2 Role of the Primitive Ontology 

What is the connection between empirical predictions and primitive ontology (PO)? 
Since the PO is described by the variables ^ giving the distribution of matter in space 
and time, a statement like "the experiment ^ has the outcome z" should mean that 
the PO of the apparatus indicates the value z. For example, if the apparatus displays 
the outcome by a pointer pointing to a particular position on a scale, what it means for 
the outcome to be z is that the matter of the pointer is, according to the PO, in the 
configuration corresponding to z. Thus, the outcome Z is a function of the PO, 

z = C(0 • (3) 

Precursors of our treatment of the connection between predictions and PO can be 
found in [9l |29l HOl HH [161 [21 [31 [7], in some of which this connection was implicit, or 
hinted at, or briefly mentioned. In Bohmian mechanics [IHITT], a similar connection be- 
tween PO and the empirical predictions was explicitly made in [22]; however, researchers 
working on Bohmian mechanics have essentially always been aware of this connection — 
much in contrast to those working on collapse theories, who tended to focus on the wave 
function and forget about the PO. 

The fact that GRWm and GRWf have the same formalism, despite their difference 
in PO, may suggest that the PO is not so relevant after all. That is true for practical 
applications which require working out the figures of the predictions, but not for the 
theoretical analysis of GRW theories, for their logical structure, or for their definition, 
as the considerations in this paper exemplify. 

1.3 Status of the Derivation 

It may seem as if the GRW formalism were a rather trivial consequence of the master 
equation ([ID. So it is perhaps useful to make a list of what is nontrivial about our 
derivation of the GRW formalism: 

• It is not a priori clear that a GRW formalism should exist. 
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— The existence of a GRW formalism had not been noticed for 20 years. 

— Since the predictions of GRWm and GRWf deviate from those of quantum 
mechanics, it is not obvious that they can be summarized by any small num- 
ber of simple rules. 

— The derivation of the GRW formalism has a status similar to that of the quan- 
tum formalism from Bohmian mechanics (see, e.g., [25]), a result implying 
in particular that there is no possibility of experimentally testing Bohmian 
mechanics against standard quantum mechanics. If that claim is non-obvious 
(after all, some authors have claimed the contrary), then so should be the 
GRW formalism. 

— The non-linearity of the GRW evolution of the wave function might have 
suggested against the existence of a GRW formalism using linear operators. 
On the other hand, the master equation ([1]) is linear in pt, a crucial fact for 
deriving the GRW formalism. Still, this fact alone does not imply the GRW 
formalism 

• Our assertion about the GRW formalism concerns the PO. In detail, it states 
that the matter density function m{x,t) of GRWm and the set F of flashes in 
GRWf are such that macroscopic apparatuses display certain results with certain 
probabilities. 

— Our derivation of the GRW formalism is based on an analysis of the behavior 
of the PO. Such an analysis was not done in (TJ [S]. 

— Our derivation applies to the matter density ontology and to the flash ontol- 
ogy. We do not make claims for any other ontologya 

— The defining laws of GRWm and GRWf, unlike the ordinary axioms of quan- 
tum mechanics, do not refer to observations, but to the wave function and 
the PO. Thus, the empirical predictions are not immediate from the defining 
laws of the theory but require a derivation. 

— To the extent that it is not obvious how the PO variables m{x,t) and F be- 
have, it is not obvious how macroscopic apparatuses (built out of the elements 
of the PO) behave. 

— It has often been noted that there are situations in which m{x,t) and F 
behave in an unexpected, surprising, or counter- intuitive way. (See, e.g., [HI 
p. 347], [3i footn. 5].) 

^For example, we do not know of a way of deriving the GRW formalism from GRWm other than 
exploiting the empirical equivalence to GRWf (or MBM j4j), even though ([T]) is valid in GRWm. 

^However, there are reasons why every reasonable ontology suitable for the stochastic GRW wave 
function evolution law should lead to the same empirical predictions. Similarly, the empirical contents 
of CSLm, the Continuous Spontaneous Localization theory 35, 27 . 6J with the matter density ontology, 
or with any other reasonable ontology, can presumably be summarized by a formalism very similar to 
the GRW formalism. 
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• Every physicist knows rules for what can be concluded about measurement results 
if the wave function is such-and-such. These rules, however, cannot be used in 
the derivation of the GRW formalism, partly because the GRW theories are not 
quantum mechanics, and partly because it is the aim of the derivation (and of this 
paper) to deduce, and not to presuppose, rules for the results of experiments. 

— Our derivation makes no use of the rules of standard quantum mechanics for 
predicting results of experiments given the wave function. 

— Our derivation makes no use of any customs of standard quantum mechanics 
for how to interpret or use wave functions. 

— In particular, operators as observables emerge from an analysis of the GRW 
theories, they are not postulated; in fact, they are not even mentioned in the 
definition of the GRW theories. 

— Certain wave functions may easily suggest certain macro-states, but this does 
not mean that the configuration of the PO looks like this macro-state. Our 
derivation makes no use of such suggestive assumptions. 

• As a consequence of our analysis, there are severe limits on the epistemic access 
to microscopic details of the PO variables m{x,t) or F. In other words, there 
are limits to the extent to which one can measure m{x,t) or F. This fact can be 
regarded as an instance of surprising behavior of the PO (as mentioned above), 
and underlines that it is not obvious which functions of the PO are observable. 

The issue we mentioned in the last item of the list deserves more comment. It 
turns out to be impossible to measure, with any reasonable microscopic accuracy, the 
matter density m(a;, t) in GRWm (or, presumably, the set F of flashes in GRWf), unless 
information about the wave function of the system is available. Limitations on the 
observers' access to m(x, t) were described before in [13]; we describe here several similar 
limitations. As a particular example, one might wish to measure the number of collapses 
that occur in a certain system (e.g., a tiny drop of water) during a chosen time interval, 
in analogy for example to the measurement of the number of radioactive decay events in 
a sample of radioactive matter. Heuristic considerations suggest, perhaps surprisingly, 
that it is impossible to measure the number of collapses, with any accuracy and reliability 
better than what one could estimate without any measurement at all. In other words, 
the precise number of collapses is empirically undecidable, and thus GRWm and GRWf 
entail sharp limits on knowledge. In a GRWm or GRWf world, certain facts are kept 
secret from its inhabitants. Note that this situation does not arise from anything like 
a conspiratorial character of the theory, but simply as a consequence of the defining 
equations; after all, we do not make postulates about what can or cannot be measured 
but analyze the theory. Similar limits on knowledge are known for Bohmian mechanics, 
where for example it turns out to be impossible to measure the (instantaneous) velocity 
of a particle [251126] . unless information about the wave function is available; as another 
example, it turns out to be impossible to distinguish empirically between certain different 
versions of Bohmian mechanics (see |3D] for a discussion). 
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A question we do not address here is how to do scattering theory for GRW theories. 
But we briefly state the problem. Normal quantum scattering theory (see, e.g., [22]) 
involves limits t oo, which would be inappropriate in GRW theories because one 
consequence of GRW theories is long run "universal warming," since every collapse tends 
to increase energy, as it makes the wave function narrower in the position representation 
and therefore wider in the momentum representation. In the limit t — >■ oo, scattered 
wave packets in a GRW world would therefore always end up with infinite energy, and 
uniformly distributed over all spatial directions. From a practical point of view, the time 
scale of free flight in real scattering experiments (~ 10^^ s) is much smaller than the 
time scale of universal warming (~ 10^^ years p8l p. 481]), usually even much smaller 
than the time scale of collapse (~ 10^ years), but much larger than the time scale of 
the interaction process. Thus, a simple and quite appropriate method of predicting the 
scattering cross section in a GRW world is to take the limit t oo for the unitary 
evolution, which is the dominating part of the evolution of the wave function \E't over 
the relevant time scale. But this is to ignore the difference between the predictions of 
GRW theories and quantum mechanics for scattering theory, and the question remains 
how to compute GRW corrections to the quantum formulas for scattering cross sections. 

Another question we do not address here is: Which condition characterizes the regime 
in which the GRW theories empirically agree with quantum mechanics? 

Finally, although the GRW formalism is valid for both GRWm and GRWf, the 
status of the derivation is very different for the two theories. While we derive the GRW 
formalism as precise theorems from GRWf, we do not know of a similar derivation from 
GRWm. In fact, the only way we know of to derive it for GRWm is by exploiting the 
empirical equivalence with GRWf, and the argument for the empirical equivalence is not 
as mathematical in character as the derivation of the GRW formalism from GRWf. 

2 The GRWm and GRWf Theories 

GRWm was essentially proposed by Ghirardi et al. [13] and Goldstein [29], and taken 
up in [6l[2l[3ll[T6lll2l[3l[7l[H]. GRWf was proposed by Bell in ^ and taken up in 
[I3[32l[29lll0l[2l[3ll[T6lllTlll2l[3ll^ For a detailed discussion of these two choices of 
PO see [3]. Both GRWm and GRWf are non-relativistic theories. The relativistic GRWf 
theory proposed in [ID] has a more complex mathematical structure than GRWf and is 
not covered by the considerations in this paper. A third type of PO was proposed for 
collapse theories on lattices by Dowker et al. [TH [20l [21] , which will not be considered 
here. 

2.1 The GRW Jump Process in Hilbert Space 

In both GRWm and GRWf the evolution of the wave function follows, instead of the 
Schrodinger equation, a stochastic jump process in Hilbert space, called the GRW pro- 
cess. We shall summarize this process as follows. 



10 



Consider a quantum system of (what would normally be called) "particles," de- 
scribed by a wave function \E' = \l'(xi, . . . , x^), Xi e R'^, i = 1, . . . , N . For any point x 
in R^, define on the Hilbert space of the system the collapse rate operator 



where Qi is the position operator of "particle" i. Here a is a new constant of nature of 
order 10~''m. 

Let \&fo be the initial wave function, i.e., the normalized wave function at some time 
to arbitrarily chosen as initial time. Then evolves in the following way: 

1. It evolves unitarily, according to Schrodinger's equation, until a random time 
Ti = to + ATi , so that 

^Ti = ^^ATi^to, (5) 

where Ut is the unitary operator Ut = e~Ti^^ corresponding to the standard Hamil- 
tonian H governing the system, e.g., given, for spinless particles, by 

k=l 

where m^, k = 1, . . . ,N, are the masses of the particles, and V is the potential 
energy function of the system. ATi is a random time distributed according to the 
exponential distribution with rate A^A (where the quantity A is another constant 
of nature of the theoryj§ of order 10"^^ s~^)- 

2. At time Ti it undergoes an instantaneous collapse with random center Xi and 
random label Ji according to 

Ii is chosen at random in the set {1, . . . , A^} with uniform distribution. The center 
of the collapse Xi is chosen randomly with probability distribution 

P(Xi G dxi\^T,Ji = ^i) = (^TjAn(a;i)|^Ti)rfxi = ||A,,(si)^/2^rJ|'rfxi. (8) 

3. Then the algorithm is iterated: \E'ti+ evolves unitarily until a random time T2 = 
Ti + AT2, where AT2 is a random time (independent of ATi) distributed according 
to the exponential distribution with rate A^A, and so on. 



^Pearle and Squires [35] have argued that A should be chosen differently for every "particle," with 
Xi proportional to the mass rtii. 
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Thus, if, between time to and any time t > t^, n collapses have occurred at the times 
to < Ti < T2 < . . . < Tn < t, with centers Xi, . . . , X„ and labels /i, . . . , /„, the wave 
function at time t will be 

where = {(Xi, Ti, /i), . . . , T„, /„)}, and 

(The scalar factor in the first line will be convenient for future use.) Since Tj, Xi, li 
and n are random, \E't is also random. We will also call the collapsed wave function, 
particularly when in need to contrast it with the "uncollapsed" wave function Ut-to '^to- 
ll should be observed that — unless to is the initial time of the universe — also '^to 
should be regarded as random, being determined by the collapses that occurred at 
times earlier than to- However, given '^to-, the statistics of the future evolution of the 
wave function is completely determined; for example, the joint distribution of the first 
n collapses after to, with particle labels Ji, . . . , J„ G {1, . . . , A^}, is 

P(Xi e dXi,Ti e dti, h = ii, . . . ,Xn e dXn, Tn G dtn, In = inl'^to) = 

WHfn) "^toW^ dXidti ■ --dXndtn, (H) 

with /„ = {(xi, ti, ii),..., {xn, tn, in)} and 

I^(/n) = l.o<t.<...<*„A"/2e-^^(*"-*o)/2x 

X A.„(x„)^/' f/*„-t„_,A.„_,(x._i)^/2 f/*„_i-t„-. ■ ■ ■ An(^i)'^' U,,.,,. (12) 



(The symbol Ic is 1 if the condition C is satisfied and otherwise.) The expression (fT2il 
is the same as L^t^^t^^^fn) defined in ffTOj) (and is explicitly set to zero if the t^ are not 
ordered increasingly). 

We have described the law for the evolution of the wave function. We now turn 
to the primitive ontology (PO). In the subsections below we present two versions of 
the GRW theory, based on two different choices of the PO, namely the matter density 
ontology (in Section [2^ and the flash ontology (in Section [273|1 . 

2.2 GRWm 

In GRWm, the PO is given by a field: We have a variable m{x, t) for every point x G M'^ 
in space and every time t > to, defined by 



m(a;,t) = ^^mj / dxi- ■ ■ dx^ — x)\^ ti^i-, ■ ■ ■ ,xm)\ ■ (13) 
i=i 
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In words, one starts with the |\l/p-distribution in configuration space M^^, then obtains 
the marginal distribution of the z-th degree of freedom Xi G by integrating out all 
other variables Xj, j ^ i, multiplies by the mass associated with Xi, and sums over i. 
Alternatively, (fT3|) can be rewritten as 

m{x,t) = {^t\Hx)\^t) (14) 

with A(a;) = ^{Qi - x)- 

The field m{-,t) is supposed to be understood as the density of matter in space 
at time t. GRWm is a theory about the behavior of matter with density m{-,t) in 
three-dimensional space. 

2.3 GRWf 

According to GRWf, the PO is given by "events" in space-time called fiashes, mathe- 
matically described by points in space-time. In GRWf, histories of matter are not made 
of world lines but of world points. The fiashes form the set 

F = {(Xi,ro,...,(x,,T,),...} 

(with Ti < T2 < . . .), or, when we consider labeled flashes, 

P = {{^iiTi, h), . . . , {Xk,Tk,Ik), ■ ■ ■} 

with Jfc G =Sf = {1,...,A^}, the set of labels. The GRWf law of the fiashes asserts 
that there is a fiash at the center {X, T) of every collapse, with the appropriate label. 
Accordingly, equation ffTTj) gives the joint distribution of the first n fiashes, after some 
initial time to- 

Note that if the number N of the degrees of freedom in the wave function is large, as 
in the case of a macroscopic object, the number of fiashes is also large (if A = 10^^^ s~^ 
and N = 10^'^, we obtain a rate of 10® fiashes per second). Therefore, for a reasonable 
choice of the parameters of the GRWf theory, a cubic centimeter of solid matter contains 
more than 10® fiashes per second. Such large collections of fiashes can form macroscopic 
shapes, such as tables and chairs. That is how we find an image of our world in GRWf. 

We should add that the mathematical scheme of GRWf that we have introduced 
here is not the most general one possible. The fiash rate operators A(x) do not have 
to be of the form (jlj) but could be other positive operators [3T], they could depend on 
time, A (a;) = At{x), and they could even be allowed to depend on the previous fiashes 
|45j . (The latter case occurs in the relativistic GRWf theory presented in |1Q].) The 
considerations in this paper are still valid if the A (a;) are other positive operators than 
in (jl]) and if they depend on time, but we do not consider the case in which they depend 
on the previous fiashes. For the sake of concreteness readers can simply take A(x) to be 
the multiplication operators (jlj). 
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2.4 Empirical Equivalence 

As already remarked, it is known that GRWf and GRWm are empirically equivalent, 
i.e., they make always and exactly the same predictions [3J. In other words, there is no 
conceivable experiment (including future advances in technology) that could distinguish 
between GRWf and GRWm. This follows from the following even stronger statement: 
When applying the flash ontology and the matter density ontology to the same wave 
function \1/ obtained from the GRW process, the two PO histories are macro-history 
equivalent, i.e., all macroscopic facts come out the same way. 

Let us elaborate on this statement. What we mean is to consider a realization of the 
GRW jump process in Hilbert space as described in Section [2?T] (that is, for every t), 
and then both the GRWm world and the GRWf world associated with this defined 
by m{x,t) as in f[T^ for every t, respectively by putting a flash at the center of every 
collapse of What we mean by macro-history equivalence is that the macroscopic 
world history is the same in both worlds, including, e.g., the weather in a particular 
place at a particular time, lottery numbers, and more generally the exact sequence of 
outcomes of any experiment. This is more than empirical equivalence, as the latter 
requires not that all random events come out the same way in two worlds, but only that 
the outcome statistics are the same, or, put differently, that one cannot conclude from 
the two macroscopic histories which one is governed by which of two theories. Clearly, 
macro-history equivalence implies empirical equivalence. 

For GRWf and GRWm, macro-history equivalence holds with overwhelming proba- 
bility. That is, although there do exist wave functions \l/ for which the macroscopic facts 
in the GRWf world are different from those in the GRWm world, such wave functions 
are extremely improbable to come out of the GRW process. 

Here is the argument. It suffices to consider a macroscopic amount of matter, which 
we call the "pointer" (though it could also be, e.g., the shape of ink on paper), that 
can either be in position 1 or position 2 at time t, and a wave function of the form 

= Ci$i + C2$2) where is concentrated on configurations in which the pointer is 
in position i; we assume ||$j|| = 1 and |cip + |c2p = 1. If, in GRWm, the matter of 
the pointer is in position 1, then this means that m{l,t) ^ m{2,t); thus, |cip ^ |c2p; 
thus, it is much more likely that flashes occur at position 1 than at position 2; thus, 
with probability near 1, in GRWf the matter is also in position 1. 

To appreciate how close to 1 this probability is, recall that, as a consequence of the 
GRW process for it is overwhelmingly likely that either |cip or |c2p will become 
exorbitantly small within a fraction of a second (see ^ for concrete numbers in a realistic 
scenario). Note also that, in the unlikely event that many flashes occur in position 2 
between t and t -\- At and thus create a discrepancy between the pointer position in 
GRWf and that in GRWm, the associated collapses would shrink the size of Ci to a 
considerable extent; so much indeed, if the number of flashes in position 2 is sufficient, 
that \ci{t -\- At)p is close to zero and \c2it + At)p close to 1; as a consequence, m(l, t -\- 
At) ^ m(2, t-\-At). That is, even in the unlikely event of a discrepancy, it cannot persist 
longer than the time it takes the collapses centered at position 2 to make |ci(t + At)p 
small. 
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2.5 Systems 



Since we have not specified, in the definition of the GRW theories, which kinds of systems 
the defining equations, such as ([9]) through (fT2|) . apply to, they a priori apply only to 
the universe as a whole. For any system, being a subsystem of the universe, equations 
of the same kind may or may not apply, but there is no need, and indeed no room, 
for postulates about this because the equations for the universe will determine what 
is true about any subsystem. Hence, the wave function \1/ we were talking of is the 
wave function of the universe. However, in our analysis of the empirical predictions of 
GRWm and GRWf, we will have to consider systems: the system corresponding to those 
instruments which comprise the apparatus for the experiment and, most importantly, 
the system upon which the experiment is performed. For this, it will be helpful to 
formalize the notion of system, as well as that of the wave function of a system. 

To begin to approach such a notion, note that usually a system corresponds to some 
of the "configuration variables" in the wave function, 

* = ^(g) = ^(gsys,genv) (15) 

where q = {xi, . . . ,xn) is the configuration variable of the universe, gsys that of the 
system, and genv that of its environment (the rest of the world); defining a system 
amounts to splitting the universe in two parts, the system and its environment. For 
example, Qsjs may correspond to a certain collection of "particle variables" , say 

Qsys = {xi,..., xm) and genv = (a^M+i, ■ • ■ , xn) ■ (16) 

Since for the GRW theories, the configuration variables do not play a fundamental 
role, our mathematical definition of "system" is formulated in different terms, namely 
in terms of the Hilbert space and of the primitive ontology. 

For our purposes, a system is defined by two ingredients: 

• A splitting of Hilbert space according to 

= Jifsys ® J^env ■ (17) 

For example, such a splitting is provided by ffT^ according to J^sys = L'^ilsys), 

J^env = i^'(genv), and = L^q). 

• A splitting of the PO; this means, in GRWf, a splitting of the fiashes according to 

F = Fsys U Fenv , -f^sys H Fcnv = , (18) 

or, in GRWm, a splitting of the matter density according to 

t) + t). (19) 

In each of the two theories, we assume that the splitting is defined either through 
a subset ^sys C ^ of the set of labels (corresponding to different types of 
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flashes/collapses), or through a region i?sys ^ in space, or a combination of 
both: In GRWf, a flash belongs to F^ys if and only if it occurs in i?sys O'lT'd its label 
belongs to ^sys', -fenv '■= F \ Fsys- In GRWm, nisys is the contribution to m{x,t) 
from labels in =Sfsys at locations in -Rgys: 



x, t) = l^eR^y^ mj y" dxi ■ ■ ■c?XAr5(a;i - x) |^t(a:i, . . . ,XAr)|\ (20) 



and menv = m — rrisys- The set Fgyg C F of the system's flashes may happen to 
be empty, but even in that case the definition of the system in terms of ^sys and 
Rgys will be useful. In the example of f|T6l) . ^sys = {1, ■ • ■ ,M}, while Rgys does 
not play a role. 

The example provided by f|T5l) suggests that everything that could be considered 
a system in orthodox quantum mechanics also defines a system in the sense of our 
definition. 

We say that the system has wave function ipsys if the wave function of the universe 
factorizes according to 

^' = 1p,ys ® ^env (21) 

with ipsys £ '^sys sud ipenv ^ '^nv Siucc it follows that uot cvcry system has a wave 
function at every time, it will also be useful to say that the system has reduced density 
matrix psys if 

Psys = trenv|^)(^| (22) 

with trenv the partial trace over J^env 

We call a system a GRW system if it has an autonomous GRW dynamics, i.e., if it 
behaves as if it were alone in the universe. We postpone the exact definition of what 
that means to Section 17.1. 2( there we will also show that a system is a GRW system if 
and only if it does not interact with its environment. 



3 Mathematical Tools 
3.1 POVM 

Recall that, while many quantum experiments are associated with self-adjoint operators, 
this is not the most general case, which corresponds to positive- operator-valued measures 
(POVMs, also known as "generalized observables" ; see [17J and Section 4 of [25] for an 
introduction). Leaving out some technical details of the mathematical definition, we 
recall that a POVM on the set Q acting on M' is a mapping 

E:A^ C{.^) (23) 

from a cr-algebra A over Vt (the family of all subsets of VL regarded as "measurable") 
to the space of bounded operators on the Hilbert space with the properties that 
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(i) E{B) is a positive self-adjoint operator for every B E A, (ii) E{Q) = I, the identity 
operator, and (iii) E{-) is a-additive, i.e., for pairwise disjoint Bi, B2, . . . E A 

00 00 

E{[jB,)=J2E{Bk). (24) 

k=l k=l 

(All subsets and functions we consider will be assumed to be measurable with respect to 
the relevant cr-algebras. A positive operator S with S < I is also called an effect in the 
literature [33], and a POVM also an effect-valued measure.) By virtue of the spectral 
theorem, the self-adjoint operators correspond to special POVMs, the projection- valued 
measures (PVMs) on the real axis. In many cases relevant to us, ^2 will be a finite or 
countable set; in that case, the POVM is determined by the operators associated with 
singleton sets, = E{{u}), according to 

E{B) = Y,E^, (25) 
uieB 

and any collection of positive operators {E^)^^q such that 

J2e^ = I (26) 

defines a POVM. We will thus often identify the POVM with the collection (ii^^)^^^^. 

The following two very simple observations about POVMs will be used in the course 
of this paper: 

Function Property. // the distribution of the random variable X depends on a 
system's wave function via a POVM D{-), ¥{X E A) = {tp\D[A)\ijj) , and if the 
random variable Y is a function of X , Y = f{X), then the distribution of Y is also 
given by a POVM: 

F{Y e B) = {iIj\E{B)\^) with E{B) = D{f-\B)) . (27) 

Reduction Property. If D(-) is a POVM on f2 acting on J^i ® M2, and if (f) E 

has = 1, then the partial scalar product 

E{B) = {<P\D{Bm (28) 
defines a POVM E{-) on Q acting on M\. 

3.2 The Distribution of the Flashes 

In GRWf, the joint distribution of all flashes, as a functional of the initial wave function 
\E'jo, is given by a POVM G(-), called the history POVM. Let us elaborate on this 
statement. 
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Reformulating ffTTl) . the joint distribution of the first n flashes is given by a POVM 
Gn{-) on VLn = (ffi'^ X M X (where M.^ represents space, M time, and ^ is the set of 
labels) , 

e dfn) = {^to\Gnidfn)\'^to) (29) 

with dfn = dxidti ■ ■ ■ dxndtn a "volume element" around = {(xi, ti, zi), . . . , (x„, z„)} 
in n„ and 

Gn{B)= [ d/„L*(/„)L(/„) (30) 

with L{fn) defined by f|T2|) (and L* denoting the adjoint of L). It is easy to convince 
oneself that Gn(') is a POVM; see JUj for a rigorous proof. 

It is no surprise now that also the joint distribution of all flashes is given by a POVM 
(?(■); see [H] for a rigorous proof. The space on which G{-) lives is Q^t^^^oo), where ^[s,t) 
means the space of all histories of flashes in the time interval [s,t) (possibly t = oo), 
i.e., the set of all discrete (finite or countable) subsets of x [s,t) x =Sf. 

Finally, consider -F[s,t), the set of all flashes during the time interval [s,t). Since 
it trivially is a function of F, the set of all flashes, by the function property (1271) its 
distribution is given by a POVM G'[s,t)(-) on Q[s,t)- We can specify it explicitly for s = to 
and t < oo: 

G[,„,,)(i?)= / rf/Lf,„,)(/)L[,,,)(/) (31) 
Jb 

with df the Lebesgue measure, which exists on the space ^'^^f-j of finite flash patterns 
because it exists on each sector 

^!to,t) = {/ C X [to, t) X ^ : #/ = n} . (32) 
Note that, since is almost surely finite for t < oo, G[tg^t) is concentrated on ^^^^-^ = 

I lOO on 

u„=o"[to,t)- 



3.3 The Conditional Probability Formula 

Set, for the ease of notation, to = 0. A simple and important consequence of the 
distribution law ffTTl) of the flashes is the conditional probability formula, which asserts 
that, for < s < t < oo, 

P*o {F^s,t) e B I F[o,,)) = P^,, eB). (33) 

Here, P,i,p means the distribution obtained starting from the wave function \E'o) and Pij,^ 
the one obtained starting from at time s. Note that the dependence on -F[o,s) of the 
right hand side is through "^g, which is a function of -F[o,s)- In words, the conditional 
probability formula asserts that the conditional distribution of the flashes after time s, 
given the flashes before s, coincides with the distribution obtained from starting the 
universe at time s with wave function \I's. 

This formula is the ultimate reason why it is natural in GRWf to regard the collapsed 
(GRW) wave function \E's as the wave function at time s: because the distribution of the 
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future flashes after s (given that the past was what it was) agrees with the distribution 
arising from as the initial wave function at time s. 

The conditional probability formula follows from the Markov property of the stochas- 
tic GRW process \E't, defined by 

¥^,{^t e B\^s' =i^s'ys' G [0,s]) =n,{^t e B\^s = ^s) (34) 

for every t > s. The Markov property means that the process is memoryless. To see 
how the conditional probability formula follows, note first that the history of the wave 
function between and s is determined by (and, conversely, determines) the flashes 
between and s, so that conditioning on \E's/ = ips'^s' G [0, s] amounts to the same 
thing as conditioning on -F[o,s) = f[o,s)- Furthermore, the Markov property implies (by 
varying s) that (1341) still holds when the event \E'f G -B is replaced with the event 
{"^ti, ■ ■ ■ , ^tfc) G -B' for s < ti < . . . < tk, or in fact any event about the future after s. 
Again, the future history of the wave function is in one-to-one correspondence with the 
future flashes, so that 

P^„(F[,,t) G 5|F[o,.)) = P*o(^[.,t) e B\^s) , (35) 

and the right hand side coincides with the right hand side of fl33l) . Alternatively, an 
algebraic-analytic derivation of the conditional probability formula can be found in Ap- 
pendix \M 

4 How Operators Emerge 

We will formulate and derive the GRW formalism in Section [61 At this stage, we can 
already understand, in a particularly easy way, how operators emerge from GRWf, and 
that is why we present this aspect first. 

We give a simple derivation for the main theorem about POVMs in GRWf, i.e., for 
the fact that in GRWf, as in quantum mechanics, there is a POVM E{-) for every 
experiment, so that the probability distribution of the outcome of the experiment, when 
performed on a system with wave function ip, is given by {%l)\E{-)\il)). To appreciate the 
substance of this derivation it is relevant to realize that the definition of GRWf did not 
mention operators as observables. Thus, operators as observables were not put in, they 
come out by themselves. 

Many physicists find such a situation hard to imagine, and that is why this point 
deserves a separate section. Many physicists are used to thinking that the central role 
of operators in quantum theory, particularly in view of their non-commutativity, con- 
stitutes a crucial departure from classical physics, and, even more, from any kind of 
theory describing an objective reality, or any kind of theory that can be understood as 
clearly as a classical theory. According to this widespread view, the non-commutativity 
of operators entails that reality itself is paradoxical and will forever remain incompre- 
hensible to us mortals. This view is often connected to the key word "complementarity." 
Now the shocking result is that the same non-commuting operators appear in GRWf, a 
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theory describing an objective reahty which can indeed be understood as clearly (and 
as easily) as a classical theory! 

This is not so shockingly new since the same can be said of Bohmian mechanics (see, 
e.g., pn [25] ). and since it has been clear for 20 years that GRW theories make almost 
the same predictions as quantum mechanics [281 E] . Nonetheless, it is worthwhile to get 
a good grasp of how exactly this can be so, how non-commuting operators can emerge 
from a theory describing non-paradoxical reality. 

Here is the derivation. Recall from Section 13.21 that the joint distribution of all 
flashes after time t is given by a POVM G[f,oo)(') on the appropriate space fl[t,oo) of 
flash histories and the wave function of the universe at time t. Let t be the time at 
which the experiment begins. Consider splitting the universe into a system (the object 
of the experiment), the apparatus of the experiment, and the rest of the world. It so 
happens that for the argument that follows, the division between apparatus and the rest 
of the world is irrelevant, so we put the two together and call them the environment 
(of the system). The division between the system and its environment corresponds to 
a splitting of the Hilbert space into = J^ys ® J^env', the splitting F = Fgys U Fgnv of 
the flashes is not needed in this section. We assume independence between the system 
and the environment immediately before t, so tha10 

^j = ^®0. (36) 

Here is fixed, being part of the characterization of the experiment, while ip, the initial 
state of the system upon which the experiment is performed, is allowed to vary in the 
system Hilbert space Jifgys- The outcome Z of the experiment is a function of the pattern 
F of flashes, 

Z = C(F) (37) 

with ( : Q[t^oo) ^ 1 where 2^ is the value space of the experiment. That is so because 
the flashes define where the pointers point, and what the shape of the ink on a sheet of 
paper is. (It would even be realistic to assume that Z depends only on the flashes of 
the apparatus, but this restriction is not needed for the further argument.) Therefore, 
the distribution of the random outcome Z is given by 

¥{ZeB)=¥{FeC\B)) = {^t\GoC\B)\^t) = {m{BM V5C^, (38) 

^Readers may worry that the factorization condition (j36p never holds because of the symmetrization 
postulate: As soon as both the system and the apparatus contain electrons, the wave function has to 
be anti-symmetric in the electron variables Xi, so that a splitting as in (|16p . grouping some variables 
Xi together as "system variables" and others as "environment variables," would conflict with ([36]) . The 
answer is, (|36|) can hold nevertheless, as follows: For identical particles, the indices of the variables 
a;i, . . . , a;Ar are mere mathematical labels, and the splitting into system and environment should not be 
based on these unphysical labels but instead on regions of space. Indeed, if i?sys C is a region of 
space such that both i?sys and M.^ \ i?sys have positive volume then J^{M.^) = ^(i?sys) ® \ Rsys), 

where Jif{S) is the fermionic (or bosonic) Fock space over L^{S). Since a fermionic wave function can 
be represented by a vector \l/ G J^{R^), it can indeed factorize in the splitting based on i?sys. 
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where the first scalar product is taken in the Hilbert space of the universe and the second 
in the Hilbert space of the system (i.e., the object of the experiment), and E{-) is the 
POVM given by 

E{B) = {<p\GoC\B)\<p) yBC3f, (39) 

where the scalar product is a partial scalar product in the Hilbert space of the environ- 
ment. Thus, for every experiment in GRWf the distribution of outcomes is given by a 
POVM E{-) on which is what we wanted to show. 

At this point, we would like to go through the derivation again, carefully keeping 
track of the ingredients in the argument: 

• The distribution of flashes in GRWf is given by a POVM G{-). In more detail: 

— G{-) is a POVM on the total Hilbert space = M'sys ® ^nv, where J^sys is 
the Hilbert space of the system and that of its environment, including 
all apparatus. 

— What we really want is, of course, the conditional distribution of the flashes, 
given what happened up to the time t when the experiment begins. By 
the conditional probability formula fl33|) . this distribution is (\&i|G[t_oo)(')l^<) 
with the (collapsed) wave function at time t. 

• The outcome Z of an experiment in a GRWf world must be a function of the 
flashes (usually, just of the flashes belonging to some apparatus), Z = C{F). 

• By the function property fl27|) of POVMs, the distribution of the outcome is also 
given by a POVM on Jf. 

• Consider a particular setting of the experiment, as encoded in G ^cnv] ask for 
the dependence of the distribution of the outcome Z on the wave function ip G J^sys 
of the object. In particular, assume factorization, = ip ^ (p. 

• By the reduction property fl28l) of POVMs, the distribution of Z as a function of 
ip is given by a POVM E{-) on ^ys. 

We close this section with a few remarks. 

• The POVMs corresponding to different experiments may well, and typically will, 
not commute. Even the single POVM E{-) may be non-commuting, in the sense 
that E[Bi) does not commute with E{B2) for suitable sets Bi,B2 C The 
simple derivation above, just a few lines long, shows how non-commuting operators 
can emerge from a picture of reality (a random set of flashes) that is completely 
coherent, clear, easy-to-understand, complementarity-free, and paradox-free. Why 
do different experiments correspond to different POVMs? Because they correspond 
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to different choices of the interaction Hamiltonian between the system and the 
apparatus, as well as different choices of 00 



• Since we know that the predictions of GRWf and GRWm are very close to those 
of quantum mechanics for all presently feasible experiments, for these experiments 
the POVM E{-) = E^^'^i-) should be very close to E^"{-), the POVM predicted 
by quantum mechanics. For a principled consideration see Section I6.4[ 

• We called the result of our reasoning the "main theorem about POVMs" in GRWf. 
Let us be explicit about the mathematical theorem that is involved here. It was 
formulated before as Theorem 8 in [15] and asserts the following: 

Let Jif = t^ys®^nv be a separable Hilbert space, G{-) a POVM on a measurable 
space {Q,An) acting on M' , a fixed vector in ^emr with ||0|| = 1, and ( : 
{Q,An) {^,As) a measurable function. For every ip G J^sys with \\ip\\ = 1, let 
"ift = ip ^ 4>j E be a random variable in Q with distribution {'^t\G{-)\'^t) , and 
let Z = C{F). Then there is a POVM E{-) on {^,At) acting on .y^sys so that the 
distribution of Z is {ip\E{-)\ip) . 

The proof of this theorem is a straightforward application of the function property 
(127|) and the reduction property (!28l) of POVMs. What is important for us here is 
to appreciate the relevance of this theorem as the appropriate mathematical for- 
malization in GRWf of the physical statement of the main theorem about POVMs, 
viz. of the statement that with every experiment S , there is associated a POVM 
E{-) such that the probability distribution of the random outcome Z of S , when 
performed on a system with wave function tp, is given by¥{Z E B) = {iIj\E(B)\iI>) . 

• If the wave function (p of the environment were not fixed but random, we would 
still end up with a POVM, as long as cj) is independent of ip (at least conditionally 
on all information available to us about the experimental setup): we would have 
to replace (139|) by 



with /i the distribution of 0. 

• The reader may find it confusing that part of the characterization of the experiment 
was the specification of 0, the wave function of the system's environment: After 
all, it will be practically impossible to repeat the experiment with the same 0, as 
comprises everything outside the system; for example, when we try to repeat the 
experiment at a later time, the moons of Jupiter will have moved, and the state of 
the lab will have changed as it will contain records of the previous experiment. So 
for practical purposes it is important that E{-) as given by fl5^ does not depend 
on all details of 0, but only on a few features of that we can control — and thus 
repeat. Mathematically, however, (!39|) provides the correct POVM, and (!38|) the 

^From the point of view of the entire universe, from which the Hamiltonian may be regarded as 
fixed once and for aU, the relevant choice would lie only in that of </>. 




(40) 
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correct distribution, no matter whether we are able to evaluate or control this 
expression. 



Note that the derivation did not assume any pre-determined time at which the 
experiment is over. It allows that the time at which the outcome Z can be read off 
depends on Z itself, a situation that occurs, e.g., in a time-of-arrival measurement, 
with Z the time when a detector clicks. 

What if factorization \E't = ® is not exactly satisfied, but only approximately? 
Then the probability distribution of the outcome Z is still approximately given by 
('?/' I )!'?/'). More precisely, suppose that, instead of fl36l) . 

= (g) + , (41) 

where ||A^f|| < 1, ||^| = ||0|| = 1, and c = a/1 - ||A^P (which is close to 1). 
c T% 



Then for any B C 



Z ^B) - {i)\E{B)\i)) <'i\^^\. (42) 



This estimate conveys that the relevant measure for quantifying the size of the 
deviation from perfect factorization is the norm of the deviation A\Ef. 

We do not know of a similar derivation of the main theorem about POVMs from 
GRWm, mainly because the probability distribution of the random function m{-,t) 
is not given by a POVM. Nevertheless a derivation from GRWm has been given in 
[7], however one that is rather different in character: It requires great effort and 
yields a limited result, as it assumes a special, idealized type of experiment and, 
since it allows for small errors in the outcome statistics, does not show that the 
outcome statistics is exactly given by a POVM. 



5 The Quantum Formalism 

Before we formulate the GRW formalism, we formulate for comparison the standard 
quantum formalism in the way relevant to us. We begin with the simplified version that 
one learns in beginner's courses and that suffices for many applications. 

The Simplified Quantum Formalism. 

• A system isolated from its environment has at every time t a density matrix pt 
which evolves according to the unitary Schrodinger evolution, 

^ = -msys,Pt]. (43) 

^To see this, write ¥{Z e B) as (*t|G o C"^(B)|*t); insert dUD; use < G o C~HB) < I to bound 
the term quadratic in A^* by ||A5'|p; use the Cauchy-Schwarz inequaUty and |c| < 1 to bound the 
cross terms by 2|jA4'||; use that 1 — |cp = |jA4'|p; in total, by the triangle inequality, obtain the bound 
2||A*||(1+ ||A*||) < 3||A*|| provided ||A*|| < 1/2. 
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With suitable experiments S there is associated a self-adjoint operator A on 
(called the "observable" ) with pure point spectrum; let its spectral decomposition 
be 

A = Y,zP,, (44) 

2 

with Pz the projection to the eigenspace with eigenvalue z. When the experiment 
S is performed on a system with density matrix p, the outcome Z is random with 
probability distribution 

P(Z = ;z) = tr(P^ p) . (45) 
In case Z = the density matrix immediately after the experiment is 



The last rule contains the standard kind of collapse of the wave function, induced 
by "the observer." 

We will need a more general formulation since the above formalism applies only to a 
narrow class of experiments, usually called "ideal measurements." And for this we will 
need some more mathematical notions. 

5.1 Mathematical Tool: Completely Positive Superoperators 

We recall that the trace class TRCL{Jif) is (roughly speaking) the space of all operators 
with finite trace. It contains in particular the density matrices. 

By a superoperator we mean a C-linear mapping ^ : TRCL{Jifi) TRCL{J^2)- 
A superoperator ^ is called completely positive if for every integer k > 1 and every 
positive operator p G C^^'^ T RC L{M[) , {Ik <S) ^)(p) is positive, where Ik denotes the 
identity operator on C'^^*^ [151 [33]. (Completely positive superoperators are also often 
called completely positive maps. If for every density matrix p, tr^(p) < 1 (as will be 
the case for all superoperators that we consider in this paper) then ^ is also called a 
quantum operation [33].) 

Completely positive superoperators arise as a description of how a density matrix 
changes under the collapse caused by an experiment: If p is the density matrix before 
the collapse, then ^(p)/tr^(p) is the density matrix afterwards. The simplest example 
of a completely positive superoperator is 

^ip) = PpP, (47) 

where P is a projection. Note that for a density matrix p, ^(p) is not, in general, a 
density matrix because completely positive superoperators do not, in general, preserve 
the trace. 

In order to prove the complete positivity of a given superoperator, the following 
facts are useful: If p2 is a density matrix on then the mapping ^ : TRCL{Jifi) — 
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TRCL{J^i^ J^2) given by ^(p) = p®P2 is completely positive. Conversely, the partial 
trace p i— *• tr2 p is a completely positive superoperator T RC L{J^i® ,^2) TRCL{Mi). 
For any bounded operator R : J^i P ^ RpR* is a completely positive super- 

operator TRCL{M[) TRCL{J&2), where R* : ,j^2 ^ M is the adjoint of R. The 
composition of completely positive superoperators is completely positive. Positive mul- 
tiples of a completely positive superoperator are completely positive. Finally, when a 
family of completely positive superoperators is summed or integrated over, the result 
is completely positive. Indeed, these rules suffice for all cases we will encounter in this 



For example, the master equation ([T]) of the GRW evolution has the property that 
the solution pt as a function of the initial datum po is given by a completely positive 
superoperator =^[o,t), pt = ^[o,i)Po (indeed, ^[o,t) is trace-preserving). Here is a simple 
path to that conclusion: Since composition preserves complete positivity, it suffices 
to convince oneself that pt+dt as a function of pt is given by a completely positive 
superoperator; using the rules in the previous paragraph (in a non-rigorous way), this is 
the case for both the unitary contribution exp{—iHdt/h)ptexp{iHdt/h) and the non- 
unitary terms in ([T]). 

A canonical form of completely positive superoperators is provided by the theorem of 
Choi and Kraus [T51E3] (also sometimes connected with the name of Stinespring), which 
asserts that for every bounded completely positive superoperator : TRCL{J^i) 
TRCL^Mq) there exist hounded operators Ri : J^i —>■ M2 so that 



where ^ is a finite or countable index set. 
5.2 The Formalism 

We are now prepared for formulating the quantum formalism in greater generality. With- 
out an essential loss of generality, we only consider experiments with discrete value space 
, i.e., experiments for which the set iF of possible outcomes is finite or countable. The 
reason why this is essentially no restriction is that every experiment in practice has lim- 
ited accuracy, and thus indeed only a finite number of possible outcomes. Nevertheless 
it is sometimes convenient to consider a continuous variable z, and indeed, as far as the 
main theorem about POVMs, or fH^ . is concerned, we can allow ^ to be any measur- 
able space (i.e., set with a a-algebra), including the possibility of a continuous variable 
z. However, when trying to formulate the collapse rule flSTl) for a continuous variable 2, 
difficulties arise that lie outside the scope of this paper. 

The Quantum Formalism. 

• A system isolated from its environment has at every time t a density matrix pt 
which evolves according to the unitary Schrodinger evolution P3l) . 



paper. 




(48) 
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With every experiment S with discrete value space beginning at time s and 
ending at time t, there is associated a POVM {E'^'^)^^^ on ^ acting on ^yg. 
When the experiment S is performed on a system with density matrix p^, the 
outcome Z is random with probabihty distribution 

P(Z = z) =tr(p,EQ'^). (49) 

With S is further associated a family (^i^'^j^gj- °f completely positive superop- 
erators acting on TRCL{^sjs) with the consistency property that, for all trace 
class operators p, 

tr{pE^^)=tT^?^{p). (50) 

In case Z = z, the density matrix of the system at time t (immediately after the 
experiment) is 

,^^(P^, (51, 



Since readers may not be familiar with this formulation of the quantum formalism, 
we elucidate it a bit in the following subsections. We begin with a remark. 

The assumption that the experiment is over at a fixed time t is not in all practical 
cases satisfied, for example when the experiment measures the time at which a detector 
clicks. To keep this discussion simple, we postpone the discussion of experiments whose 
duration is random (i.e., decided upon by the experiment itself) to Section [HI 

5.3 First Example 

To begin with, the simplified quantum formalism is contained in the full quantum for- 
malism in the following way: Let ^ be the spectrum of the self-adjoint operator A (a 
finite or countable set since we assume pure point spectrum), E^'^{-) the spectral PVM 
of A, 

= Pz , (52) 

and 

^i^"(p) = PzPPz (53) 

for every operator p in the trace class. Then, the consistency property (!50l) is satisfied 
since 

ti{pE^^) =tr(pP,) =tr(P,pP,) =tr<^Q"(p). 

Eqs. (gnD and §2) reduce to (l45|) and (1461). 

In general, the set ^ need not be a subset of M. For example, an element of ^ — an 
outcome of the experiment — could be a list of numbers (^ C M"), or simply a name 
like "up" or "down". 
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5.4 Consistency Between Superoperators and POVM 

Using the Choi-Kraus theorem 

^,(p) = 5^i?,,pi?:, (54) 

(where we dropped the superscript "Qu" for ease of notation), we can show that the 
POVM E{-) associated with is completely determined by the (^2)26^ according to 

E,= Y,R:,R,i. (55) 

To see this, note that the consistency property fISU]) implies, with flM|) . that 
tr (p E,) = tr %{p) =tiJ2 Rzi P = tr ^ Rl,R,i p . 

i<^J^z i<^.^z 

Specializing to p = \^){^\, we obtain 

idJ'z 

for every if), and therefore (l55l) . Moreover, it follows from fl50l) by summing over all 
z & ^ that Ylzi^s' trace-preserving. 

Conversely, suppose the {^z)z(^S' are given and that the superoperator Yliz<^S''^z 
trace-preserving. Then fl55l) defines a POVM E{-) satisfying fl50|) : RliRzi is a positive 
operator, and E[^) = I because, for every vector ip in Hilbert space, 

(^IE(S')I^) =tr(|^)(^| 5^^i?:,i?2.) = 

To see that fl50l) holds, note that 

tr(pE2) = 5^tr(pi?:,i?2.) = 5^tr(i?2.pi?:J =trK(p). 

5.5 Another Example 

Here is an example illustrating how the POVM E{-) and the superoperators "ioz arise, 
and how to do calculations with them. Suppose we carry out two experiments S'l and 
S2 in a row on the same system with a break of t time units in between, and regard the 
entire procedure as one experiment S' whose outcome Z is given by the pair (Zi, Z2) of 
outcomes of Si and £'2- Suppose we know the POVMs Ei^^i and -£"2,22 (foi' the ease of 
notation, we drop the superscript "Qu") as well as the superoperators ^1^21 and '^2,22; 
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and want to determine the POVM = E(^zi,z2) ^"^^ the superoperators = ^(21,22) 
corresponding to For example, ffi and S2 could be ideal measurements as described 
in the simplified quantum formalism. We will see that in that case S is (in general) not 
itself an ideal measurement, and E{-) a proper POVM (i.e., not a PVM). 

The value space of <f is ^ = ^ x The joint distribution of Zi and Z2, if the 
system starts with density matrix p, is 

p(Zi = Z2 = = nzi = = z2\zi = Zl) = 

[using the consistency property ( j50l) ] 

= tr(e-^^*/"^i,2,(p)e^^*/"E2,.2) = 
[using the Choi-Kraus theorem for ^i^zj 

= tr(e-^*/^5^i?i,,,pi?t.,,e^^*/^E2,.) = 

i 

= tr{pY.RU,e^"'^'E2,.,e-^^^/^R,,,,?l = 

i 

= tr(pE(2i,22)) (56) 

with 

-^(21,22) = ^ -RMi,«^*'^*^'^-^2,22e~*'^*^^-Ri,2i,i • (57) 

i 

Note that this expression defines a POVM, since each summand is a positive operator 
and E{Ti X ^2) = /: 

Zl 22 21 i 22 

= / 

= E E -^i,2i,i-Ri,2i,i = E -^I'^i ^ ■ 

21 i Zl 

In case ffi and (02 are ideal measurements, the formula fl571) reduces to 

i?(2i,22) = Pi,2i e^''*/'^P2,22e-'^*/^PMi . (58) 

If Pi^zi commutes with Q^^t/f^p^ _^^Q-^^t/f>' (equivalently, if the self-adjoint operators Ai 
and ^^Ht/h^^^-iHt/h commute) then E{zi^Z2) is itself a projection, and E{-) is a PVM, 
but in general it is not. 
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The final density matrix after given tliat tlie outcomes were Zx = zi and Z2 = Z2, 

is 

^ tr^2,.2(e-^^*/Vie'^*/'') tr^2,.2(e-^^*/^^i,.,(p)e*^*/'*) ' ^ ^ 

Tliat is, tlie superoperators corresponding to S' are given by tlie composition law 

= ^2...(e-^^*/'^'^i,.,(p)e^^*/^) , (60) 

which is completely positive as a composition of three completely positive superopera- 
tors: ^1,^1, the unitary evolution, and '^2,^2- If '^i ^2 are ideal measurements, so 
that '^i^zi and '^2,22 are of the form (|53ll . then 



^(.„.2)(P) = i^2,.2e-^^*/'^PM,pPM,e^^*/'^P2,.2 , (61) 

which is not itself of the form fl53|) . unless t = and Pi^^-^ commutes with ^2,22- thus 
exemplifies how ^ can be different from flS5]) . 



5.6 The Law of Operators 

How does one know which is the POVM (E'^")^^^ and which the family of 
superoperators associated with S"! In practice, this is part of the working knowledge, 
and it is sometimes obtained by trial and error, or by symmetry arguments, or other 
methods of guessing. It is also often suggested by "quantization rules", but we prefer 
here a rule that is generally valid (and does not appeal to classical physics). 

The Quantum Law of Operators. 

• Suppose we are given the density matrix papp for the ready state of the apparatus, 
its Hamiltonian if app, and the interaction Hamiltonian Hj. Let 

Ut = e"i(^=>'=+^^pp+^^)* (62) 

be the unitary Schrodinger evolution operator for the composite (system + appa- 
ratus). Let the experiment start at time s and be finished at time t, so that the 
result can be read off at t from the apparatus]^ Let PJ'pp be the projection to the 
subspace of apparatus states in which the pointer is pointing to the value z. Then 

= trapp([/sys ® Papp]t/;_J/sys ® P^^'^'Pt-s) (63) 

and 

^.^"(P) = trapp([/sys ® PrPt-slp ® Papp]f/;_J/sys ® Pr]) , (64) 

where trapp denotes the partial trace over the Hilbert space of the apparatus. We 
check the consistency property ( 150|) in Appendix [Bl 

^"^This assumption is to be understood in an operational sense: It is assumed that we humans can 
read off the result when looking at the apparatus. This is different from assuming that the result can 
be read off from the wave function of (the system and) the apparatus, which is notoriously not the 
case, a fact known as the measurement problem of quantum theory. 
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In other words, the superoperator 'if^'^ is obtained by solving the Schrodinger equa- 
tion for the apparatus together with the system, then collapsing the joint density matrix 
as if applying the collapse rule to a "quantum measurement" of the pointer position, 
and then computing the reduced density matrix of the system. 

To obtain that E^^(-) is a POVM, we need that E^ej^^^i*^" is trace -preservmg. 
Indeed, 

tr ^i^''(p) = E ^^{ut-sip ® Papp]f/;_j/sys ® pr?) = 

= tr(t/,_,[p®p,pp]f/;_,[/sy,®5^P;PP]) =tr(f/,_4p®papp]f/;_J =trp, 
provided 

pr = . (65) 

(This equation amounts to the statement that the experiment always has some outcome. 
This is normally not true, as, e.g., the apparatus might get destroyed by some accident 
with small but nonzero probability. However, we may deal with this trivial problem by 
assuming that the set ^ of all possible outcomes contains one element representing the 
possibility that the experiment was not properly carried out.) 

6 The GRW Formalism 
6.1 The Formalism 

The GRW formalism is very similar to the quantum formalism. There are only three dif- 
ferences: (i) the unitary Schrodinger evolution (l43l) between the experiments is replaced 
with the master equation ([1]); (ii) the POVM E'^^^{-) associated with an experiment 
as its "observable" may be different from i?^"(-), and (iii) the superoperators "io^^ 
(encoding the "observer-induced collapse") may be different from Thus, it reads 

as follows. 

The GRW Formalism. 

• A system isolated from its environment has at every time t a density matrix pt 
which evolves according to the master equation ([T]). 

• With every experiment with discrete value space S", beginning at time s and 
ending at time t, there is associated a POVM E'^^ {■) on ^ acting on J^sys- When 
the experiment is performed on a system with density matrix p^, the outcome 
Z is random with probability distribution 

P(Z = z) =tr(p,E,^^W). (66) 
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With S is further associated a family i^f^^^^^ of completely positive superoper- 
ators acting on TRCL{J^sys) with the consistency property that for all trace-class 
operators p, 

tr(pE,«^W) =tr^,G^w^^)_ (g7) 

In case Z = z, the density matrix of the system at time t immediately after the 
experiment is 

o-o'-^S^^ (68) 



For the same reasons as for the quantum formalism, we assume a discrete value space 
In theories (like GRWm and GRWf) with a clear PO, on the other hand, one might 
consider experiments using an "analog" rather than "digital" display, for example ones 
in which the outcome is displayed as the center-of-mass position of a pointer. However, 
even in this case it is reasonable to regard the outcome as discrete, since it is hard to 
regard microscopic details of the pointer's PO as a means to display information about 
the outcome. 

Corresponding to the simplified quantum formalism, one can also formulate a sim- 
plified GRW formalism, which coincides with the simplified quantum formalism: For 
suitable (but not all) experiments S' it so happens that E'^^'^{-) is a PVM (i.e., that 
is a projection for all subsets B C ^), that ^ is a subset of M, and that 
'rf^^lp) = PzpPz for suitable projections P^. In this case, all the data encoding in- 
formation about S' needed for computing outcomes (i.e., iF, i?^^^(-), and {'^^^^)z£s) 
can be encoded into a single self-adjoint operator, A = J^zeS"^^^- 

The GRW Law of Operators. 

• Suppose we are given the density matrix papp for the ready state of the apparatus, 
its Hamiltonian H^pp, and the interaction Hamiltonian Hj, so that H = Hgys + 
-f^app + Hi- Let the experiment ^ start at time s and be finished at time t, and 
let ( : ^[s,t) —>■ 3f he the function that reads off the outcome of from the flashes 
between s and t. Then ii^^^^(-) is given by fl5^ . or, equivalently, 

Ef^'^ = tT.,pp J d/[/3,s®Papp]^[.,,)(/)^M(/), (69) 

and 

^''''^(P) = trapp J rf/L[,,)(/)[p®papp]Lf,,)(/). (70) 



We check the consistency property (|67|) in Appendix [Bl 

Before we begin the derivation of the GRW formalism, we have to elucidate a bit 
more what exactly it asserts. 
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6.2 Isolated System 



The "system" is mathematically represented, as described in Section 12.51 by a splitting 
^ = ® of Hilbert space, as well as a splitting F = Fgys U Fgnv of the flashes, 
grounded in either a set =Sfsys of labels or a region _Rsys C (or both) selecting Fgyg. 
When we say that a system is isolated or (ioes not interact with its environment, we 
mean two things: First, the Hamiltonian does not contain an interaction term, that is, 

®i/env. (71) 

Second, the collapse operators associated with flashes of the system act only on J^sys 
but not on J^env, and vice versa: 

_ l^r'(^) ® ^cnv if i e =^sys and x G i?sys ^^^^ 
1 /sys ® A^'^^(x) otherwise. 

This second condition, apart from expressing that the splitting = J^sys ® is 
compatible with the splitting F = FgysUFgnv, is necessary because otherwise the system 
could, despite the absence of an interaction Hamiltonian, interact through collapses with 
the environment; e.g., an initial product wave function could become entangled. 



6.3 Density Matrix 

Density matrices can arise in two ways: either as representing a statistical mixture (or 
ensemble) of wave functions, or as the reduced density matrix of a system entangled with 
another system (which we will call system b in the following, while system a is the system 
of interest). Both types of density matrices are allowed in the GRW formalism: the 
system under consideration may be entangled with system b (but not to the apparatus 
of the experiment), and the wave function (of the two systems together) may be random. 
It is part of the statement of the GRW formalism that, in this case, (i) the density matrix 
Pt of the system still evolves according to the master equation ([T]) as long as it remains 
isolated (from system b, from the apparatus, and from everything else); (ii) the statistics 
of the outcome Z depends only on the density matrix of the system; (iii) in case Z = z 
the system's reduced density matrix after the experiment is given by fl68l) . 



6.4 Smallness of Deviations 

In this subsection, we characterize the "quantum regime" of the GRW theories, i.e., 
the regime in which the GRW formalism agrees with the quantum formalism. We do 
so in a sketchy way by comparing the laws of operators in the quantum and the GRW 
formalism, (!63|) and (!69|) . which we repeat here for convenience: 

E^'' = tr,pp([/3ys ® Papp]f/;-J/sys ® P^lf/t-.) , (73) 
/?.^^^ = tr,pp J d/ [/sys® Papp] /^m(/)^m(/)- (74) 
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The goal is to describe conditions under which one can see in the above two formulas 
that 

A first condition is that during the experiment, collapses are likely to occur only in 
the apparatus, not in the system: 

= 0) ^ 1 • (76) 

To ensure this condition, the collapse rate per particle A was chosen so small that for a 
microscopic system (e.g., A^'sys = 10"^) the average time between collapses (l/Ai'sygA ~ 10^ 
years) is still much larger than the duration of any practical experiment. 

The next step concerns the calibration function ( that specifies how to read off the 
result Z from the flashes of the apparatus. In the quantum law of operators fl73|) it 
is implicit that the result is read off from the apparatus at the end of the experiment, 
i.e., at time t. To make a comparison possible, we should assume as well that ( in flTll) 
depends only on the flashes near t. So let < e -C t — s, let us divide the flashes during 
[s,t) into the "early flashes" Feariy = F[s,t-e) and the "late flashes" Fiate = -F[i-e,t), and 
let us assume that ( depends only on the late flashes: 

C(F[,,)) = aF[t-e,t)) ■ (77) 

While e must be big enough to allow a macroscopic number of late flashes, we assume 
it is so small that the unitary time evolution can be neglected: 

f/e ~ / . (78) 

We draw some conclusions from the assumptions so far. By the definition ( fTOl) of 

L[s,t){f) = -f'[t-£,t)(/late) -^^[s,t-e) (/early) • (79) 



By dZi, 

i^[t-.,)(/iate) = A#^-/^e-^^^/^ n ^^(^y^" ' (80) 

(x,t',i)e/latc 

where the order of factors in the product is arbitrary since the A are all (in the position 
representation) multiplication operators and thus commute. Let us set 

H^) ■■= j c//late^[t-e,t) (/late) (/late) (81) 

= j rf/iateA#^'-e-^^^ n ^^(^)- (82) 

Given the meaning of P^^"^ and Aj(a;), given that P^^pp also is (at least approximately) 
a multiplication operator, and taking into account fl76|) . it is plausible that 



A(z)^Jsy,®P;PP. (83) 
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We thus obtain that 




Comparison with (173|) shows that the difference to i?^*^ hes in replacing Ut-s by 
L[s,t){f) and integrating over /. By the definition (fTOl) of L[s^t){f), it contains unitary 
operators Ut^^j^-t^ which, were there no collapse operators in between, would combine 
to exactly Ut-s- Thus, f l85p agrees with E'^^ under the second condition that the early 
collapses of the apparatus do not change the statistics of the outcome Z. Put differently, 
the second condition says that the result of the GRW collapse process for the wave 
function \1/ of system and apparatus is more or less the same as if \1/ evolved unitarily 
until t and then GRW collapses took place at t. In most cases, this condition will 
presumably be true because of decoherence, i.e., because wave packets that are disjoint 
in the configuration space of the apparatus and associated with different outcomes z 
will remain disjoint under the unitary evolution, at least until time t. A more detailed 
and careful analysis of the second condition is non-trivial and has been carried out in 
[8] for a concrete example model of an apparatus in the framework of QMUPL [18], a 
spontaneous collapse model very similar to the GRW process. 

7 Derivation of the GRW Formalism 

After some preparatory considerations in Section mi we prove the GRW formalism from 
GRWf in Section \7.2\ followed by an example in Section 17.31 

7.1 Density Matrix 

We need to collect some facts about density matrices in GRWf. 
7.1.1 Statistical Density Matrix 

Set, for ease of notation, = 0. Since the wave function is random, with its 
distribution there is associated the density matrix 



-'^-'^By the way, the expression ([85|) corresponds to the following naive way of computing predictions of 
GRW theories without properly taking into account the role of the primitive ontology: Starting from 
= -0 (g) let ^ evolve according to the GRW process until time t, and if [/gys <S> Pf^Pj^'t « (i.e., 
if Vl/t approximately lies in the subspace to which P^pp projects) then say that the outcome is z. This 
is likely the case for some z because macroscopic superpositions are unlikely to survive for long. 



(85) 




(86) 
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where Ei{Jif) = G Jif : ||\E'|| = 1} is the unit sphere in Hilbert space J^. In other 
words, fl5^ is the density matrix of a large ensemble of systems, each of which started 
with the same initial wave function \l/o but experienced collapses independently of the 
other systems. 

We note that the density matrix pt obeys the master equation ([1]). But the validity 
of ([1]) is even wider: Suppose that even the initial wave function \l/o is random, with 
distribution given by any probabihty measure /xq on E>{J^). Then, for t > 0, is 
doubly random, because of the random initial wave function and of the stochastic GRW 
evolution, with distribution 



Pt{-) = J /io(rf^o)P^o(^tG-)- (87) 
Again, the corresponding density matrix 

Pt = Ej^t){'^t\ = J /it(ci^) |^)(^| (88) 
obeys ([T]). To see this, note that it satisfies 

Pt = J /io(c?^o) J P*o(*i e d$) |$)($| , (89) 

where the inner integral obeys ([1]), so that pt is a mixture of solutions of ([1]) and therefore 
is itself a solution of ([T]). 

Alternatively, pt can directly be expressed in terms of po according to 



Pt = ymPo= J dfL[o,t){f)PoLlo,t){f)- (90) 

From this the master equation ([1]) can be obtained by differentiation with respect to t. 
As a by-product, it can be read off from fp]]) that the mapping ^[o,t) '■ Po ^ Pt obtained 
by evolving the density matrix p according to the master equation ([T]) is a completely 
positive superoperator. 

The following proposition is a consequence of the fact that the distribution of flashes 
is given by a POVM: If the initial wave function \l/o is random with distribution /iq, then 
the distribution of the flashes depends only on the density matrix po associated with pq, 



nFe-) = J po{d^o)no{F e ■) = J poid^o) (^^^(Ol^o) =tr(poG'(-)) 



(91) 



with 

Po 

In other words, if two probability distributions po and po have the same density matrix, 
Po = Po, then they lead to the same distribution of the PO. For comparison, this is not 



[ po(rf*o)|^o)(^o|. (92) 
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true in Bohmian mechanics or GRWm: there, fto and /io may lead to different trajectories 
[ini [23] respectively to different probability distributions of the m function [3]. 

Since /io and fio lead to the same distribution of flashes, we may write Pp^ for that 
distribution. This also means that we can simply talk of the flash process for a given 
initial density matrix, as opposed to the flash process for a given initial wave function. 
As time proceeds, the density matrix determining the distribution of the flashes evolves 
according to the master equation in the sense that 

Fpo(i^[t,oo)G5)=Pp.(i^GS), (93) 

where the right hand side refers to the distribution of the flashes when starting with pt 
at time t. This fact follows from the conditional probability formula by averaging over 

7.1.2 The Marginal Probability Formula 

The marginal probability formula expresses that a system which does not interact with 
its environment is itself governed by GRWf, even if the system is entangled with the 
environment. (Note that this is not true, e.g., in Bohmian mechanics, where the trajec- 
tories of the system's particles depend on the configuration of the environment, even in 
the absence of interaction. As we will see, it is not true in GRWm either.) 
The marginal probability formula says that for an isolated system, 

P*„(F,,sG5)=P,,,,(5). (94) 

Here, P^r^ is the distribution of the flashes in a universe starting with wave function \&o 
at time to = 0, and P^(,(-Fsys G ■) is the marginal distribution of the system's flashes; 
Psys = tfenv |^o)(^o| IS the reduced density matrix of the system; finally, Pp^^^ is the 
distribution of flashes in a universe containing nothing but the system and starting with 
density matrix psys at time in the sense of equation (191]) : 

Pp...(-) = tr(psysGsys(-))- (95) 

We provide a proof of the marginal probability formula in Appendix O 

The marginal probability formula was flrst derived by Bell [9j for the purpose of 
proving a no-signalling theorem for GRWf. To see the connection, suppose the system 
is Alice's lab, which does not interact with Bob's lab (e.g., because they are, when 
considering the relevant time intervals, spacelike separated); then the distribution of 
the flashes in Alice's lab, and thus in particular the distribution of the outcome of any 
experiment, does not depend on external flelds at work in Bob's lab, nor on the common 
wave function ^^o except through the reduced density matrix psys- 

The marginal probability formula should not be confused with the following simple 
consequence of the function property flTTI) : Since Fgyg is a function of F, its distribution 
is given by a POVM E{-), namely 

P^o(^sys eB) = {^o\E{B)\-^o) . (96) 
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The marginal probability formula goes further in two respects: First, its right hand side 
depends only on the reduced density matrix psys, and not on the entire wave function 
^o! second, the POVM Gsys(') is not just some POVM but exactly the one that would 
govern the flashes if the universe contained nothing but the system. 

A related fact is the independence property: If a system does not interact with its 
environment and is initially disentangled from its environment, then the flashes of the 
system and those of the environment are stochastically independent, i.e., their joint 
distribution is a product: 

I''|sys)(gD|env) (-^sys ^ ^sys; ^cnv ^ ^cnv) -Plsys) (-^sys ^ -^sys) -Plcnv) (-^env ^ -^cnv) • ('^'^) 

Moreover, in that case the wave function \E'j remains a product at later times. 

In GRWm there is a formula that is in a way analogous to the marginal probability 
formula of GRWf, as it connects psys to the PO of the system, namely to mgys as intro- 
duced in fl20l) . However, it is much weaker as it connects psys not to the entire future 
history of the PO, for t > 0, but just to the PO at t = 0. This formula reads 

"^sys(a;,t = 0) = rui j dq^ysSiq^ys^i- x) {qsys\psys\qsys) (98) 

assuming, for simplicity, that the system is defined in terms of a label set Jifgys, not 
of a region -Rgys- As before, psys = trenv |^o)(^o|- The formula implies that a different 
wave function 7^ with trenv |^o)(^o| = trenv |^o)(^o| would lead to the same mgys- 
An analogous statement holds in Bohmian mechanics: the marginal distribution of Qsys 
at t = depends only on psyg. Note that in GRWm, rusys cannot be obtained from a 
statistical density matrix. 

Returning to GRWf, we call a system a GRW system if the distribution of the flashes 
of the system (after time 0) is given by psys (at time 0), i.e., if (p4|) holds. The marginal 
probability formula thus asserts that every isolated system is a GRW system — a system 
whose PO behaves as if the system were alone in the universe. Conversely, if a system is 
not isolated then it cannot be expected to be a GRW system since the interaction with 
the environment should affect the pattern of flashes. 

Now that we have the concept of GRW system, one conclusion we can draw is that 
the reasoning of Section H] applies not just to the universe as a whole but also when 
the object and the apparatus together form a GRW system. In this case, the POVM 
^GRW^.'j (depends only on the apparatus. 

A variant of the marginal probability formula asserts the following: If a system is 
isolated during [0, t) then 

n,iFl^;^eB)=¥,^jF[o,t)eB). (99) 

Here, a system can stop being isolated because the Hamiltonian or the collapse rate 
operators are time-dependent, H = Ht and Aj(a;) = Aj t(a;). 

The fact (1991) follows from the first version flMl) of the marginal probability formula: 
Consider a hypothetical universe whose time-dependent Hamiltonian Ht and collapse 
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operators Aj ^(x) are whatever we choose. Then, for a fixed initial wave function \l/o, the 
distribution of fiashes during [0, t) will depend on our choices of Hg for all s e [0, t), but 
not for s > t. So we need not specify Ht before time t. For example, if the system is 
initially isolated, we can turn on the interaction with its environment at time t, and the 
distribution of the fiashes up to time t is the same as it would have been if the system 
were isolated forever, and thus given by 



7.1.3 The Marginal Master Equation 

The marginal master equation 

(Pt)sys = (Psys)t (100) 

expresses the related fact that also the reduced density matrix of the system, when iso- 
lated from but entangled with its environment, evolves according to the master equation 
([1]). This is a general fact about the master equation, which can also be expressed by 
saying that when the system and the environment do not interact , the following diagram 
commutes: 



tCcnv sys 

Po — ' Po 



IDsys (101) 



tfcnv sys 

pt — ' Pt 

Here, ([I])sys means the master equation ([T]) applied to the system. In words, the marginal 
of the master equation is again a version of the master equation: the version that 
would hold if the universe contained nothing but the system. In another notation, 
'^[ot) °trenv = trenv o -5^[o,t) • The marginal master equation allows us to write pf^ instead 
of either (psys)t or (pjsys- 

We provide an analytic-algebraic proof of the marginal master equation in Ap- 
pendix [D1 Alternatively, here are two derivations from the marginal probability for- 
mula: First, since the distribution of the fiashes of the system depends only on (psys)t=05 
and for an isolated system the collapses associated with the fiashes of the environment 
(as well as the Hamiltonian evolution) act trivially on J^ys, (pt)sys depends only on 
(Psys)t=0) aiid since for an empty environment (pt)sys would trivially equal (psys)*, f HOUj) 
must be generally true. Second, the significance of the density matrix associated with 
the system at time t lies in governing the distribution of the fiashes after t. Thus if, 
as the marginal probability formula tells us, the distribution of the system's fiashes af- 
ter t is the same as if the system were alone in the universe and started with (psys)o; 
namely tr((pgys)t Gj'^^^^(-)) , then the system's density matrix at time t must be (psys)t- 
On the other hand, by the marginal probability formula applied to time t, the distribu- 
tion of the system's fiashes after t is tr((pt)sys Gj'j^^)(-)), so the density matrix at time 
t must be (pi)sys- (Mathematically, this argument assumes that the family of operators 
{^^t'oo)(-^) • -^}' -^^ sufficiently rich.) 



38 



7.1.4 State and Primitive Ontology 



From our discussion of density matrices in GRWf we can conclude that the appropriate 
notion of state of a GRW system is that of density matrix. Let us explain. 

While in classical mechanics the notion of state at time t used to mean "phase 
point", i.e., "a mathematical datum that determines the PO after t," the meaning 
has shifted, with the advent of quantum mechanics, to a statistical notion which, in 
our framework, can be characterized as "a mathematical datum that determines the 
probability distribution of the PO after t." For example, in classical mechanics a state 
in the latter sense would correspond to a probability distribution over states in the 
former sense. On the basis of the latter sense, what is the appropriate notion of "state" 
in the various versions of quantum mechanics? 

In Bohmian mechanics, the wave function ipt (or, in fact, the 1-dimensional subspace 
spanned by it) is such a datum, as it determines the distribution (iV'tP) of the config- 
uration Qt and, given Qt, the further motion of the configuration, {Qt')t'>t', thus, ipt 
determines the distribution P^^ of {Qt')t'>t- If V't is random, then its distribution is a 
"state," as the distribution of {Qt')t'>t then is 



However, the statistical density matrix p = f p{d4')\ip) {'ipl associated with p is not 
a "state" since, as mentioned already, some distribution jl ^ p may have the same 
statistical density matrix p = p but lead to a different distribution 7^ P^ of the 
trajectories [101 [23] • (One can postulate a Bohm-type law of motion involving a density 
matrix instead of a wave function, but then yet other trajectories would come out.) Now 
consider a system. If it is disentangled from its environment, i.e., = ipsys ® "^env with 
^ the wave function of the universe, and if the system is isolated from the environment 
(so it stays disentangled) then ipsys is a state for the system. The same is still true if 
the system possesses an effective wave function [24J, i.e., if the wave function \I' of the 
universe is of the form \E' = ipsys ® ipem + and Qt' never enters the support of \E'-'". 
In general, however, the system's wave function, namely its conditional wave function 
UM '^sys('?sys) = {^sys, Qenv) , is uot a statc bccausc the trajectory of Qgys depends 
on more than just ipsys at time t. For the same reason, the reduced density matrix 
Psys = trenv |^)(^| is not a state. 

In GRWm, the situation is the same as in Bohmian mechanics: The wave function 
is a state but the (statistical) density matrix is not [3J; a system that is disentangled 
and isolated possesses a wave function which is a state; the reduced density matrix of 
a system that is entangled with its environment is not a state (in spite of (1^51) . as the 
probability distribution of mgys at later times depends on more than psys at time 0)@ 

^^To see this, consider for example = 0) = 2~^/^(|u)|l) + |(i)|2)), where \u), \d) are orthonormal 
vectors in and |1), |2) in Jf^cm, and suppose that * quickly collapses to either or |(i)|2); 

contrast this with ^[t = 0) = 2-i/2(|;)|l) + |r)|2)), where |0 = 2-^/'^{\u) + \d)) and \r) = 2-^/^{\u)-\d)), 
and suppose that ^ quickly collapses to either |^)|1) or |r)|2). Then psys ~ ^\u){u\ + ^\d){d\ — 




(102) 
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In GRWf, however, the reduced or statistical density matrix is indeed a state for any 
isolated system. This is just a way of re-formulating the marginal probability formula. 

In orthodox quantum mechanics, it is more or less the results of experiments that 
are regarded as the PO, and a state is what determines the distribution of the results of 
experiments. Thus, as in GRWf, the reduced or statistical density matrix p is a state for 
any isolated system, since the distribution of the result of an experiment, acting only on 
the system and associated with POVM acting on e^ys, is given by tr(pi?*^"(-)). 



7.2 Derivation of the GRW Formalism 

The first rule of the GRW formalism asserts in particular that with an isolated system 
at every time t there is associated a density matrix pt. Indeed, there is associated a 
density matrix pt no matter if the system is isolated or not. Let us formulate how that 
density matrix is defined: 

The system, called system a in the following, may be entangled with another system 
called system b. The wave function \Ef of a and b together at time t may be random with 
distribution pt on S(J^ ® pt may be determined by a preparation procedure, by 
previous flashes, or both. Then 

Pt=pt = J /i*(t/vi>)tr,|vi>)(vl>|. (103) 

Now the marginal master equation implies that, as long as system a is isolated, pt 
evolves according to the master equation ([T]). This yields already the first rule of the 
GRW formalism. 

A derivation of the second rule — asserting that the outcome statistics is of the form 
ti{p E{-)) — was given in SectionHl except for the possible entanglement between system 
a and system b. So let us derive the second rule in this more general situation. Suppose 
that 

1. the experiment S' involves a splitting of the world into four parts: system a (the 
"object" of ff), system b, the apparatus of i^, and the rest of the world; 

2. begins at time s and ends at time t l^ 

3. system a, system b, and the apparatus together form a GRW system (i.e., are 
isolated) during the time interval [s, t) and possess a wave function; 

4. at time s, the apparatus is not entangled with system a and b, 

^s=^ab^(l), (104) 



sys, but the mgys associated with \u) or \d) may be completely different from that 
associated with |^) or |r). 

^•^This assumption will be relaxed in Section[8l where we allow that the experiment's run-time is not 
fixed before the experiment. 
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where ipab is the (possibly random) wave function of systems a and b together at 
time s, (j) is the (possibly random) wave function of the apparatus at time s; 

5. ipab and are independent random variables; 

6. during [s,t) the apparatus interacts only with system a, while system a and the 
apparatus do not interact with system 6; 

7. the outcome Z is a function ( of the flashes of the apparatus during [s, t); this 
assumption can be weakened by allowing that Z is read off from the flashes of 
both system a and the apparatus, 

Z = C(F[:,) U F^j;^^) , (105) 

while we need to exclude a direct dependence of Z on the flashes of system b. 

By the conditional probability formula fl33|) . we can regard as the (random) initial 
wave function. By the marginal probability formula (applied to aUapp instead of a) and 
assumption [6] above, the joint distribution of the flashes of system a and the apparatus 
(given "^s) is given by 

H^M U FJPP e A\^s) = tr(pruappG'f:;r(^)) (106) 

with 

P^Uapp = trfe|vi>,)(il/,| = {tlMiiJabl) ® (107) 

and G'l'^^^'^ the history POVM for system a and the apparatus during [s, t). Let E denote 
averaging over the random wave function "^g] using that is stochastically independent 
of ipab, we obtain that 

PaUapp := Ep^^app = tu\i^ab) {i^ab\) ® (E|0)(0|) = Pa ® Papp (108) 

with Pa and papp the density matrices — as defined in fll03p — of system a and the appa- 
ratus, respectively. As a consequence, 

P(Z = z)= EP(Ff:^,) U FJPP G C-'(^)l^s) (109) 
= Etr(pr^,ppGj;^,7P(C-^(^))) (110) 

= tr(p,uappGf:;,7^(C-^(^))) (111) 

= tr(p.Ef^W) (112) 

with £;GRw 

given by (I69p — the GRW law of operators. The operators form a 

POVM because of the function property fl27j) and the reduction property fl28|) . This 
completes the derivation of the second rule. 

Now we turn to the third rule — the collapse rule. According to the definition fllU3l) 
of the density matrix of system a, the density matrix pt = p^ after the experiment is 
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obtained by the partial trace over the environment (here, b and the apparatus) from the 
density matrix associated with the probabihty distribution representing the preparation 
procedure. In this case, the preparation procedure consists of the preparation before the 
experiment ^, the experiment S' itself, and conditioning on the outcome Z = z. Thus, 

p' = pt = p1 = E(trfeuapp|^t)(^t||^ = ^) (113) 

with E the average over both the random wave function \E's of a U 6 U app before S' and 
the flashes during [s, t), conditional on the outcome z of the experiment. This expression 
can be rewritten as 

ltr,pp I dfLy,,)UM®P?'']LU)U) (114) 

which agrees with (ITOil . the GRW law of superoperators. This completes our derivation 
of the GRW formalism. 



7.3 An Example: Consecutive Experiments 

The above derivation is simple but powerful. To illustrate how much so, we now consider 
an example case, extract a prediction from the GRW formalism, and derive the same 
prediction directly from the distribution of the flashes. The example is in fact a problem 
of general interest, namely to compute the joint probability distribution of the outcomes 
of subsequent experiments on the same system ("system a"), say £"2 after Si. Suppose 

51 is carried out during the time interval [si,ti), £"2 during [^2,^2) with < si < ti < 

52 < ^2 < 00, and system a is isolated during [ti, S2). 

To compute such probabilities is exactly the purpose of the third rule, which leads 
to the equation 

, = .,|Z. = .-0 = tr(^?.,4i^^]^.fe)). (115) 

where S^^^ is the superoperator evolving a density matrix according to the master 
equation ([1]) for system a from time ti to time S2- 

To derive (11151) directly from the distribution of the flashes amounts to computing 
the conditional distribution of the positions of the two pointers, each of which consists 
of flashes. Thus, the conditional probability of Z2 = Z2, given that Zi = zi, is of the 
form 

^{C2iF[s,,t,)) = ^2|Cl(i^[.,i,)) = ^1) • (116) 

We will compute this quantity and show that it equals the right hand side of (11151) . But 
before, let us step back and make the setting explicit. We make the same assumptions 
1-[7| as in Section [7.21 for both experiments. Si and S2. At time ti, system a will be 
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entangled with both system h = hi and the apparatus of For the discussion of 

S21 the apparatus of Si will count as part of the system 62; i-e., 62 = &i U app^. We 
explicitly allow that S2 may depend on Zi. For example, experimenters may let the 
decision which "observable" to "measure" depend on Zi] or the time S2 at which <§2 
starts may depend on Zi (say, because the experimenters check the apparatus more 
carefully if Zi was unexpected); or system a may consist of several smaller systems 
(say, 200 electrons), and it may depend on Zi (say, the outcomes of 200 Stern-Gerlach 
experiments) which of these subsystems (^2 will act on (say, only those whose outcome 
was "up"). In particular, assumption [5] includes that, at time S2, the wave function of 
apparatus 2 is conditionally independent of the wave function of a U 61 U app^^, given 
that Zi = zi. 

The preparation procedure of system a for ^2 consists of: the preparation procedure 
for S'l] the time evolution until S2; and conditioning on Zi = Zi. As a consequence, 
by the definition (11031) of the density matrix of system a at time S2, is the conditional 
density matrix given Zi = zi, and it will be useful to first collect some facts about such 
density matrices. 



7.3.1 Conditional Density Matrix 

A frequently relevant special case of the statistical density matrix associated with a 
probability distribution over wave functions corresponds to conditioning on an event 
concerning the flashes between to and t. 

If a GRW world at time to = has random wave function \I'o with probability 
distribution /xo on S(^) then at time t > the conditional density matrix given B C 
n[o,t) is defined to be 

Pi|B = E^o(l^t)(^tl|^[o,t)ei?), (117) 
provided P^Q(F[o,t) G 5) > 0. As a consequence, 

'^[o,t) G b) . (118) 



P,,,JC)=E,,„(^P^,(F[,,oo)GC) 

It is a straightforward observation that the conditional density matrix for fixed B 
evolves with t according to the master equation ([1]): for < s < t < 00, 

Pt\B = y[s,t)Ps\B , (119) 

where <5^[s,t) is again the superoperator that evolves a density matrix from time s to 
time t according to the master equation ([T]). This fact is a simple consequence of the 
definition flll7p in the case that the event B concerns only -F[o,s), rather than all of -F[o,i)- 



^^Although the kind of entanglement that first comes to mind when talking about entanglement 
between system and apparatus (viz., a superposition of macroscopically different states corresponding to 
different outcomes) is essentially absent in a collapse theory (as that was the motivation for introducing 
the spontaneous collapses), a small degree of such entanglement may survive the collapses and, more 
importantly, entanglement between the system and microscopic degrees of freedom of the apparatus 
may persist despite the collapses. 
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Another basic fact (that is easy to prove) is that the conditional density matrix pt\B 
can be expressed in terms of the initial density matrix 

Po=E^„(|^o)(^o|) = j ^lo{d^)\^)m (120) 

s(jr) 

in the following way analogous to ( l90l) : 

Pt\B = ^J dfL[o,t){f)poL*o,t)if) (121) 

B 

with J\f the normalizing constant, 

Af = tT J df L[o,t)if) Po L*o,t)if) (122) 

B 



A further useful fact about conditional density matrices is the following alternative 
variant of the conditional probability formula, which expresses the conditional distribu- 
tion of the future flashes after time s, given that -F[o,s) G B, in terms of the conditional 
density matrix ps\B- for < s < t < oo, 

e C\F[o,s) eB)= P,^,, (C) , (123) 

where P^^ means the distribution of flashes when starting at time with a random wave 
function \E'o with distribution /iq, and the right hand side means the distribution of 
flashes when starting with the density matrix ps\B o-t time s. The above formula (11231) 
follows from the conditional probability formula by taking the appropriate average. 
Indeed, since for any random variable X and any event C, 

P(C|X) = g{X) implies P(C|X e B) = ¥.{g{X)\X e B) , (124) 

the conditional probability formula fl55]) implies 

n,{F^s,t) e c\F[o,s) G 5) = E^,(¥^SF[s,t) e C)\F[o,s) e 5) , (125) 

and the right hand side, after averaging over "^q, equals Fp^^^^C) by virtue of flllSp . 

It is a generalization, but also a special case, of the above alternative conditional 
probability formula (I123P that later flashes, those after t < 00, are governed by the 
time-evolved conditional density matrix pt\B = '^is,t)Ps\B, i-e., 

P.0 (^[*,oo) G C\F[o,s) eB)= P,^, JC) . (126) 
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7.3.2 Conditional Distribution of Pointer Positions 

Let us now turn to evaluating (11161) . By the alternative conditional probability formula 
(11231) . the conditional distribution of the flashes of a U 6i U appj^ U app2 after time ti, 
given Z\ = zi, is the same as if a U 6i U app^^ U appg started at time ti with density 
matrix 

aU6iUappj^Uapp2 
Pti\Zi=zi 

Since from ti onwards, the system a U app2 does not interact any more with bi U app^, 
the marginal probability formula tells us that the distribution of flashes of aUapp2 is the 
same as if a U appg were alone in the world and started at time ti with density matrix 

aUapp2 _ , aUbiUappiUappj /'107^ 

By (19^ . the distribution of the flashes in a U appg after S2 is the same as if a U app2 
were alone in the world and started at time S2 with density matrix 

aUappj _ c;^iUapp2 aUapp2 (TOQ\ 

By the marginal master equation, this equals 

aUapp2 _ ^„ aUbiUappiUapp2 MOQ^ 

Ps.2 — ^IbiUappiPs2|^^=2^ • l-L^JJ 

As a consequence of the assumption that the wave function of apparatus 2 at time S2 is 
conditionally independent of the wave function of a U 6i U app^^, given Zi = zi, we have 
that 

Hence, 



aUbiUappiUapp2 _ aUfeiUapp^ app2 n on^ 

Ps2\Z,=zi -Ps2\Z,=z, ^Ps2\Zi=Zi- U-^UJ 



aUapp2 
rs2 



Kuapp,p:i\tT>p:r- (131) 

Thus, the flashes of a U appj after S2 are distributed as if a started at time S2 with 
density matrix 

ri _ c/>a ^„ aUfciUapPi (^r>n\ 

Ps2 - '^[ti,S2)^^i'iUapPiPti|Zi=2i ■ U-^^J 

Thus, the conditional probability of Z2 = Z2 is given by 

¥{Z2 = Z2\Z, = z,) = tr(^[,"^,,^) [tr6.uapp,p;|5,=ri ^2(^2)) • (133) 

Comparison with (IllSp shows that the joint distribution of Zi and Z2 comes out right 
if 

^l,zi(Psi) _ aUfeiUappi _ 

^^biUappiP(j|2^=^j — 



[using ([m])] 



tr^i,.,(Ps-J 

;^tr,,uapp, j c^/I^[..,*,)(/)p:^^^^p^^[.,,o(/) 



cr'{^i) 

= ;^tr.pp, / ^/C')^(/)K®pr]C')^(/)% 

cr'(^i) 

which is equivalent to the equation (170]) . the GRW law of superoperators. 
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8 Random Run-Time 



We now discuss the case in which duration of the experiment S' is not fixed. Rather, 
we assume that the time at which S' is finished is itself a random quantity T, generated 
by (S itself. The starting time s, in contrast, is assumed to be fixed. We assume that T 
can take values from a finite or countable set ^ C [s, oo) (just as we assumed that the 
value space 2f is discrete). 

8.1 GRW Formalism for Random Run-Time 

The assumption Z = C{F) now gets complemented by the assumption T = t{F), i.e., 
the finishing time T can be read off from the flashes. On top of that, we assume that 
the random variable T = t{F) is a stopping time in the sense of the theory of stochastic 
processes, i.e., that r is such that 

the event t{F) < t depends only on F[s^t) , (134) 

i.e., on the flashes up to time t. In other words, we require that it is possible to read 
off from the flashes up to time t whether the experiment is over yet. In the terminology 
of the theory of stochastic processes, the space fi[s,oo) of all possible flash patterns 
is naturally equipped with a filtration {A[s,t))te{s,oo), where A[s^t) is the cr-algebra of 
all events that depend only on the flashes before t (i.e., it is the collection of those 
B C Q[s,oo) such that for any two /, /' G Jl[s,oo) with f^^^t) = f'[s,t)^ either both f,f'eB 
or both /, f ^ B); our assumption that T is a stopping time means that for each t, the 
event {T < t}, regarded as the set {/ : r(/) < t}, belongs to At- 
Moreover, we assume that 

if t(F) = t then C(-^) depends only on F^g^t)- (135) 

That is, when the experiment is over it must be possible to read off the result from the 
flashes so far. In the terminology of the theory of stochastic processes, this assumption 
is that Z is adapted to the a-algebra A[s,t) of all events that depend only on the flashes 
before T (i.e., it is the collection of those B C such that for any two /, /' G Q[s oo) 

with r(/) = Tin =■■ t and = /f,,,), either both fJ'eB or both fJ'^B). 
In this setting, the formalism reads as follows. 

The GRW Formalism for Random Run-Time. 

• A system isolated from its environment has at every time t a density matrix pt 
which evolves according to the master equation ([T]). 

• With every experiment starting at time s with a discrete set ^ of possible 
outcomes and a discrete set ^ C [s, oo) of possible flnishing times, there is asso- 
ciated a POVM E^^W(-) on ^ X 3^ acting on ^y^. When the experiment ^ is 
performed on a system with density matrix p^, the outcome Z and the time T at 
which S' is flnished are random with joint probability distribution 

P(Z = ^,T = t)=tr(p,EjD. (136) 
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With S" is further associated a family {^^f^)z^s'^t&s of completely positive su- 
peroperators acting on TRCL{J^sys) with the consistency property that for all 
trace-class operators p, 

tr(pi??r) = tr'^i:r(p)- (137) 

In case Z = z and T = t, the density matrix of the system at time t immediately 
after the experiment (f is 



The GRW Law of Operators for Random Run-Time. 

• Suppose we are given the density matrix papp for the ready state of the apparatus, 
its Hamiltonian H^^p, and the interaction Hamiltonian Hj, so that H = Hsys + 
ifapp + Hj. Let the experiment S' start at time s, let ( : fi[s,oo) — > iF be the 
function that reads off the outcome of <f from the flashes, and let r : f2[s,oo) ^ 
be the function that reads off the finishing time of S' from the flashes. Then 

E^f" = trapp([/.ys ® Papp] G{C\z) n r-\t))) (139) 

= trapp j c//[/sys®Papp]^*s,t)(/)^M(/), (140) 

C-i(^)nr-i(i) 

and 

K^'^IP) = trapp I df L[,,){f) [p ® p^pp] Ll,^{f) . (141) 

C-i(^)nr-i{t) 



Concerning fll4Up . note that by assumptions 01341) and 01351) . the set (~^(z)r\T~^{t) C 
^[s,oo) is of the form A x fl[t^oo) for suitable A C Q^g^t)- We wrote C~^{z) fl T~-'^(t) for A in 
the domain of the integral in 014OI) and 01411) : that is, the domain of the integral is to 
be regarded as a subset of ^[s,t), so that the integration variable / is a history of flashes 
in the time interval [s,t) and thus can be inserted into To see that O140p is the 

same as 01391) . note that 

G(r^(^)nr-i(t))=G(Ax%^))=G[,,)(A)= / rf/Lf,,)(/)L[,,,)(/) 

J A 

using fl3Tl) . We check the consistency condition 01371) in Appendix [Bl 
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8.2 Derivation 



The biggest difference to the derivation of the GRW formahsm for fixed run-time is that 
we now have to consider a system that is isolated from its environment only during the 
random time interval [s,T). 

In particular, we need a version of the marginal probability formula for stopping 
times: Consider a system "sys" (such as a U app), let T be a stopping time adapted to 
sys (i.e., a function r of F^^'^ = F^^^^-^ such that the event T = t{F^^^) < t depends only 
on -F[o^^)), and let A^^^j,-^ be the a- algebra of events depending only on the flashes of sys 
up to time T. If the system is isolated during [0, T) then 

P^o(F^^^ G 5) = ¥,^JB) Vfi G A"^^^ . (142) 
Put differently, this means for every t > and every B C i^j^Q 

r{F^y^) =t)= P,^^^ (F[o,) G B\r{F) = t) . (143) 

This fact follows from the marginal probability formula (1941) in much the same way as 
the version ( |99ll for a system that is isolated during the deterministic interval [0,t). 
Consider a hypothetical universe whose time- dependent Hamiltonian Ht and collapse 
operators Ai^t[x) are whatever we choose, and a fixed initial wave function \&0) so that 
the distribution of flashes during [0, t) will not depend on our choices of Hg for s > t. So 
we need not specify before t whether we will turn on the interaction at t or not, and we 
could make this decision depend on the flashes up to time t. Since this choice does not 
affect the distribution of the flashes before t, this distribution is the same as it would 
have been if the system were isolated forever, and thus given by (IMI) . 

Together with the marginal probability formula for stopping times we also obtain a 
marginal master equation for stopping times: If a system is isolated during [0,T) then 
its reduced density matrix will evolve during [0, T) according to the master equation for 
the system. 

With these tools, the derivation of the GRW formalism for random run-time follows 
the same lines as the derivation of the GRW formalism for fixed run-time. 

8.3 Quantum Formalism for Random Run-Time 

The quantum version is exactly the same, except with (i) the master equation ([T]) re- 
placed by the Schrodinger equation, (ii) the POVM E'^^l-) replaced by a different one 
E^'^i^-), and (iii) the superoperators replaced by different ones '^^^^ Here is how 

those operators are determined. 

The Quantum Law of Operators for Random Run-Time. 

• Given the density matrix papp ior the ready state of the apparatus, its Hamiltonian 
Happ, and the interaction Hamiltonian Hj, so that H = Hsjs + Happ + Hj, and Ut = 
exjp{—^Ht) is the unitary Schrodinger evolution operator for system -|- apparatus. 



[f^^;^ g b 
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Let P^j-"^ be the projection to the subspace of apparatus states in which is over, 
the pointer for the outcome is pointing to z, and the pointer for the time when 
was over is pointing to t. Then 

E?:^ = trapp ([/sys ® Papp]t/t. [hys ® ^71^^*-) (144) 

and 

<J"(p) = tr,pp([/3,, ® nT]t/*_.[P ® P.ppPls [Isys ® nT]) . (145) 
We check the consistency condition in Appendix [Bl 

How do we know that (11441) and (11451) are the right formulas? Given their com- 
plexity, and given that the consideration of random run-times is somewhat intricate, we 
need a derivation. But derivation from what? One possibility is to derive them from 
Bohmian mechanics, where the quantum formalism is thought of as a consequence of 
more fundamental postulates. Another possibility is to base a derivation on the fol- 
lowing principle: The predictions should not depend on where to put the "Heisenberg 
cut" between (quantum) system and (classical) apparatus. In particular, the predic- 
tions should be invariant under moving the Heisenberg cut outward, i.e., they should 
not change when we treat the apparatus as a quantum system and assume a "super- 
apparatus" for "measuring" where the pointer of the apparatus points. With the aid 
of this principle one could, for example, derive the quantum formalism from postulates 
that exclusively concern position measurements (applied to the pointer). Similarly, one 
could derive the formalism for random run-time from the formalism for fixed run-time, 
by assuming that all outcomes Z and times T get recorded, while all apparatus is in- 
cluded in the Schrodinger evolution until a very late time, at which all experiments are 
expected to be finished, when we use the super-apparatus to read out the records. The 
details of such a derivation shall not be worked out here. 

9 Genuine Measurements 

Genuine measurements are experiments for determining values of the variables of the 
theory, as opposed to quantum measurements, which do not actually measure anything 
in the ordinary sense of the word, i.e., do not measure any pre-existing property. Genuine 
measurements in GRWm, for example, would be experiments determining m{x,t), or 
the wave function, or some functional thereof. In this section we discuss the possibilities 
and limits of genuine measurements in GRWm and GRWf. We plan to provide a more 
thorough discussion in a future work [S] . 

9.1 Limits on Knowledge 

We show that it is impossible to measure, with microscopic accuracy. 
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(i) the matter density m{x, t) in GRWm 

(ii) the wave function tpt of a system in either GRWm or GRWf. 
Furthermore, we conjecture that it is also impossible to measure 

(iii) the space-time pattern of flashes F in GRWf or of the collapse centers in GRWm 

(iv) the number C[s^t) of collapses in a system during [s,t) in either GRWm or GRWf 

In other words, the exact values of these variables are empirically undecidahle. In 
contrast, it is possible to measure, with certain accuracy and reliability, the macroscopic 
equivalence class of either m{-,t) in GRWm, or of F in GRWf, or of ipt in both GRWm 
and GRWf. Further genuine measurements are possible when information about the 
wave function is provided, as we will explain in Section 19.21 

Let us compare this situation to that of Bohmian mechanics. Also Bohmian mechan- 
ics entails limits on knowledge: for example there is no experiment in a Bohmian world 
that will reveal the velocity of a given particle (unless information about its wave func- 
tion is given) [25l [26]. On the other hand, there is no limitation in Bohmian mechanics 
to measuring the position of a particle, except that doing so will alter the particle's 
wave function, and thus its future trajectory. Here we encounter a basic difference be- 
tween Bohmian mechanics and GRWm: the configuration of the PO can be measured 
in Bohmian mechanics but not in GRWm. (In Bohmian mechanics, the configuration 
of the PO corresponds to the positions of all particles, while in GRWm it corresponds 
to the m{x,t) function for all x G M^.) In GRWf, for comparison, there is nothing like 
a configuration of the PO at time t, of which we could ask whether it can be measured. 
There is only a space-time history of the PO, which we may wish to measure. Bohmian 
mechanics is an example of a world in which the history of a system cannot be mea- 
sured without disturbing its course, and indeed disturbing it all the more drastically the 
more accurately we try to measure it. This suggests that also in GRWf, measuring the 
pattern of flashes may entail disturbing it — and thus finding a pattern of flashes that 
is different from what would have occurred naturally (i.e., without intervention). This 
kind of measurement is not what was intended when wishing to measure the history of 
flashes. 

About (iv), we conjecture further that, unless information about the system's wave 
function is given, no experiment can reveal any information at all about the random 
number C[s,t)- This means the following. Without any experiment we can say that C[s,t) 
has a Poisson distribution with expectation NX{t — s), i.e., 

nCis,t) = n) = ^{NX{t - s)r e-^^(*-^) , (146) 

where NX is the collapse rate of the system (see after (jSj)). The conjecture is that 
no experiment on the system can produce an outcome Z such that the conditional 
distribution F{C[s,t) = would be narrower than (11461) . or indeed in any way different 
from ffT46D . 
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9.2 If Information About the Initial Wave Function Is Given 



Further genuine measurements are possible when information about the wave function 
is provided. What does that mean? For example, while there presumably exists no 
experiment that could measure the number C[s,t) of collapses between s and t on any 
given system with any wave function, there do exist experiments that work for one 
particular wave function ip and can, for a system with initial wave function ipg = ip, 
disclose at least partial information about this number. We describe a concrete example 
experiment which, though it does not reliably determine C[s^t), can estimate C[s,t) better 
than one could without performing any experiment. 

Suppose ip is the wave function of a single electron and a superposition of two wave 
packets, 

?/> = ^|here) + ^Ithere) , (147) 

as may result from a double-slit setup. Suppose, for simplicity, that the Hamiltonian of 
the system vanishes, so that the time evolution is trivial, and that the time span t — s 
is of the order 1/A^A, so that F{Cis,t) = 0) is neither close to 1 nor close to 0. We ask 
whether C[s,t) is zero or nonzero, i.e., whether a collapse has occurred. Without any 
experiment, one can only say that the probability that a collapse has occurred is 



P 



1 - e-^^(*-^) . (148) 



The following experiment provides further information. The task is equivalent to de- 
termining whether ipt = ^|here) + there) or ipt = |here) or ipt = | there), since 
any collapse would effectively reduce (I147P to either |here) or |there). To this end, 
carry out a "quantum measurement of the observable" O given by the projection to 
the 1-dimensional subspace spanned by (I147p . If the result Z was zero, then it can be 
concluded that a collapse has occurred, or C[s,t) > 0. If the result Z was 1, nothing can 
be concluded with certainty (since also |here) and | there) lead to a probability of 1/2 
for the outcome to be 1). However, in this case the (Bayesian) conditional probability 
that a collapse has occurred is less than p (and thus Z is informative about C[s^t))- 

,,i)>0|Z = l)- ^ l^'*^ ' 



p(q,,i) > 0, z = 1) + F{C[s,t) = o,z = i) 

p(z = i|q,,,)>o) p(q,,,)>o) 

F{z = i\C[s,t) > 0) nC[s,t) > 0) + nz = i\C[s,t) = 0) P(q,,t) = 0) 

Ip p 
^ <p. 



\p+l-{l-p) 2-p 

Thus, in every case the experiment can retrodict C[s^t) with greater reliability than it 
could have been predicted a priori. 

This leads us to the question whether it is possible, for a known initial wave function, 
to determine reliably whether a collapse has occurred or not. We conjecture that the 
answer is no, and that indeed no other experiment can retrodict C[s j) for the initial 
wave function (11471) with greater reliability than the quantum measurement of O. 
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We close this subsection with another example of how information about ip can help 
with genuine measurements. Trivially, if we want to measure a system's wave function 
ip then every given information about is of immediate advantage. Concretely, if we 
know that ip is one of the mutually orthogonal vectors 0i, . . . then a "quantum 
measurement of the observable" A = f^\4'k){4>k\ will reveal which one it is. 

9.3 The Quadratic Functional Argument 

We now return to the problem of measuring quantities without knowing the initial 
wave function. There is a simple argument, the quadratic functional argument, that 
will prove the impossibility claims (i) and (ii). This argument was first used, to our 
knowledge, in [22] in the context of Bohmian mechanics, and goes as follows. If the 
same experiment is supposed to measure a quantity Z for every initial wave function ip 
of the system, then the probability distribution of Z must be a quadratic functional of 
ip, i.e., F{Z = z) = {il)\E{z)\i)) for some POVM E(-), since, by the GRW formalism, 
for every experiment the distribution of its results is a quadratic functional of ip. This 
allows us to conclude that a quantity whose distribution is not quadratic in ip cannot 
be measured. 

We list some such quantities: 

• The wave function ip itself, since its distribution as a functional of ■?/' is a 5 distribu- 
tion at %p. (More explicitly, Z = ip would have the distribution P(Z E B) = Isii'), 
which is 1 if tp E B and otherwise, for any subset B G ^ = E>{J^) of the unit 
sphere in Hilbert space. Unlike ip ^ {tp\E{B)\ip), the step function 1b is not a 
quadratic function.) 

• Also the distribution of the wave function ipt at a later time t, arising from the 
initial %p = ips through the GRW evolution, is not quadratic in ip. 

• The distribution of m{x,t) is not quadratic in tps] in fact, for t = s it is a 5 
distribution. More generally, any quantity that is deterministic in ips, i.e., given 
by a functional of tps, has a 6 distribution not quadratic in ipg. 

We have thus proved statements (i) and (ii). Since the distribution of F is in fact a 
quadratic functional of ip, the quadratic functional argument does not yield statement 
(iii). If we could measure wave functions, we would be able to detect collapses by 
measuring the wave function before and after; but we cannot. 

Let us now turn to the heuristic behind the conjecture that flashes cannot be mea- 
sured. Here is a very simple, non-rigorous argument suggesting this. Suppose we had 
an apparatus capable of detecting flashes in a system. Think of the wave function of 
system and apparatus together as a function on conflguration space M^^. There is 
a region in conflguration space containing the conflgurations in which the apparatus 
display reads "no flash detected so far," and another region, disjoint from the flrst, 
containing the conflgurations in which the display reads "one flash detected so far." 
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Recall that a flash in the system leads to a change in the wave function of the form 
ip ^ ip' = Ai{x)'ilj/\\Ai{x)'ip\\, where Ai{x) is a multiphcation operator. But for such a 
change it is impossible to push the wave function from the first region to the second. 

Here is a somewhat similar argument concerning the wish to measure the location 
X of a flash. For simplicity, let us assume the system consists of a single "particle," 
and let us further assume we are given the following information about a system's wave 
function: It is a superposition of finitely many disjoint packets ipi, each so narrow that 
the width is much smaller than a, and any two so very disjoint that the distance is much 
greater than cr. Then a collapse will essentially remove all but one of these packets. Now 
a collapse acting on the system can indeed force the apparatus into a particular state, 
for example if the wave function of system and apparatus together before the collapse 
was 

J2^i^<Pe, (149) 

e 

where may be a state in which the apparatus displays the location of tpe as the loca- 
tion of the flash. The state fll49p may arise from the initial state i/jg) (po by means 
of the interaction between the system and the apparatus. However, in case no flash 
occurs, the reduced density matrix of the system arising from the state fll49p would 
be ^e\4'e){i'e\, which leads to a different distribution of flashes than the pure state 
I ^4'e){^4'e\- This means that the presence of the apparatus has altered the distribu- 
tion of the future flashes. Moreover, the state (11491) represents essentially a quantum 
position measurement, and will collapse most probably because of flashes associated 
with the apparatus, thus forcing the first system flash to occur at the location the was 
the outcome of the position measurement. 

10 Conclusions 

We have formulated a GRW formalism that is analogous to, but not the same as, the 
quantum formalism and summarizes the empirical content of both GRWm and GRWf. 
We have given a derivation of the GRW formalism based on the primitive ontology (PO). 
We have further shown that several quantities that are real in the GRWm or GRWf 
worlds cannot be measured by the inhabitants of these worlds. These were the main 
contributions of this paper. Derivations of the empirical predictions of GRW theories 
have been given before in [281 [9l El [71 [8] , but with two gaps: First, these derivations did 
not pay attention to the role of the PO. Second, these derivations focused on how to 
obtain the quantum probabilities from GRW theories, and thus ignored the (usually tiny) 
differences between the empirical predictions of GRW theories and those of quantum 
mechanics. On the other hand, earlier derivations of empirical deviations from quantum 
mechanics [28[ [39| [36l [6], [311 [1] focused on particular experiments but did not provide a 
general formalism. 

It has played an important role for our analysis that the GRW theories are given 
by explicit equations. Other collapse theories, for example that of Penrose [23 135] , 
are formulated in a more vague way that still permits to arrive at concrete testable 
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predictions deviating from quantum mechanics but does not permit any general theorems 
about arbitrary experiments. The concreteness of the GRW theories also has (what may 
seem like) disadvantages, as it gives the theory a flavor of arbitrariness, and that of a 
being "merely a toy model," as opposed to a serious theory. For example, arbitrariness 
may be seen in the existence of the two parameters A and a (whose values remain 
unknown until experiments confirm deviations from quantum mechanics), or in the 
choice of the Gaussian (jlj) (could it not be another function instead of a Gaussian?), or 
in the assumption that collapses are instantaneous, or in other aspects. But in the end of 
the day it is the concreteness of the GRW theories, or their explicit character, that paves 
the way for their successful analysis. In this paper in particular, theorems are established 
about the GRW theories, and this would not have been possible if the GRW theories 
had not been defined by unambiguous mathematics. Since we are dealing with concrete 
equations, we can derive precisely what predictions these equations entail — with rather 
unexpected results, such as the emergence of a simple operator formalism. 

It has also played an important role to be explicit about the PO, i.e., to say clearly 
what the PO is and to specify an equation governing the PO, namely (fTT!) respectively 
(IT^ . To provide such an equation is somewhat unusual; instead it is often silently 
assumed that when ipt is the wave function of a live cat then there is a live cat. Our 
derivation of the GRW formalism relied on this equation fill I) , which makes the structure 
of the argument explicit and simple. 

Another question arises once the GRW formalism is formulated: Should we not, given 
that the GRW formalism summarizes the empirical contents of GRWm/GRWf, keep only 
the GRW formalism as a physical theory and abandon GRWm and GRWf ? No. From a 
positivistic point of view it would seem so because in this view only empirical predictions 
are regarded as scientific, meaningful statements. But in our view, this position is 
exaggerated. The goal of science is not only to summarize empirical observations but 
also to explore explanations of the observations. It is entirely reasonable to ask for a 
theory that speaks about reality and not about observations, i.e., for a quantum theory 
without observers. 
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A Proof of the Conditional Probability Formula 

Note first that, for t < cxo. 



L[0,t){f) = ^[s,t)(/[s,t)) ^[0,s)(/[0,s)) 



(150) 



and 




(151) 
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the identity operator. As a consequence, 



(^0 


^0,s)if[0,s)) (/s ^[s,t) (/[.,*)) ^[.,t)(/[s,t))^ 


-^[0,s)(/[0,s)) 




(^0 


L[0,s)if[0,s)) L[o^s){f[s,t)) 





J B 



This proves the conditional probabihty formula for t < oo. The one for t = oo follows 
from the one for finite t because in the cr-algebra of fi[s,oo); the family ^snitc of events 
depending only on a finite amount of time form a fl-stable generator, and thus the two 

measures P*o (^-^[s,oo) ^ '[-^[o.s) = /[o,s)) aiid P^i^ (-F[s,oo) ^ ■) coincide since they coincide 

on v4finite- 



B Check of Consistency Conditions 

We provide the proofs of the equations expressing the consistency property between the 
POVM E{-) and the superoperator as defined in the various versions of the law of 
operators. We often use the following mathematical fact: If J^aub = =^ ® Sa is an 
operator on and Taub is an operator on ^aut then 

Sa tTbTaUb = tTb{[Sa ® Ib]TaUb) , (152) 

where tr;, means the partial trace. 

To check the consistency property (15U]) between (1^ and (^^, note that 

tr(pEQ'^) =tr([p® Jenv][/sys®Papp]f/;-,[/sys®P;P1t/t_,) = 

= tr([p ® Papp]t/;_j/sys ® p:nut-s) = 

= tr(f/,_,[p ® Papp]t/;_J/.ys ® P^]) = 
= tr([/,y3 ® ® Papp]f/i-.[/sys ® Pr]) = tl^?\p) , 

where tr always means the trace, sometimes on Jlfsys and sometimes on ® =^nv 
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To check the consistency property fl67|) between fl69|l and flTOjl . note that 



= tr 1 c^/L[,,)(/)[p®p,pp]Lf,,)(/) = tr^,G^W(p). 
To check the consistency condition (11371) between fll40p and (11411) . note that 

C-i{^)nr-i(i) 

= tr 1 rf/ [p ® Papp]^[.,)(/) ^[.,)(/) = tr ^5^W(p) . 

C-iWnr-i(t) 

To check the consistency condition between fll44p and fll45p . note that 
tr(pEQ") = tr([p ® 4pp][/,y3 ® Papp]t/;-. [/sys ® nT]t/<- 

= tr([p®p,pp][/;_, [/3ys®nT]f/t- 

= tr([/,y, ® P;f ][/i-.[p ® Papp]f/;-. [/.ys ® ]) = tr <i"(p) . 

C Proof of the Marginal Probability Formula 

Let us suppose for simphcity that the set Fgys of the flashes of the system corresponds, 
not to a region i?sys in space, but to a subset ^sys of labels. (We come back to the other 
case below.) Thus, the assumption (172!) now reads 

Since the system is not interacting with its environment, as expressed by (171 p . we have 
that Ut = e-^^=^=*/^ ® e-iH.n.t/h ^ jjsys ^ j^env_ follows that 

L{f) = L,ys(/sy,) ® Lenv(/cnv) , (154) 

where /sys (respectively /env) is the set of flashes belonging to the system (respectively 
the environment) and, as the notation suggests. 



^sys 



{(xi,ti,^i),...,(x„,t„,^„)}) = lio<i,<...<t„A'^/V^-^(*"-*»)/^ X 
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with A^'sys = #=Sfsys, and similarly for L^m. As a consequence, 

G[Bsys X -Bonv) = G'sys(-Bsys) ® G'onv(-Senv) , (156) 

where Gsys (respectively Genv) is the POVM that would govern the system (respectively 
the environment) if it were alone in the universe. (In particular, the marginal of Ggys 
for the first n flashes is given by 

G,ys.n{B) = [ df L,y,{fy L,y,{f) V5ci]„, (157) 
Jb 

in parallel to fl30|) .) From (11561) we obtain the marginal probability formula: 

P*o(Fsys e 5sys) = {^o\G{B,y, X fienv)|^o) = 

= (^o|G'sys(5sys) O/envl^o) = tr (psys G'sys(5sys)) 
with Psys = trenv|^o)(*o|- 

The case in which the system is defined in terms of a region R^ys is best treated 
in the framework described in [HI |15] that generalizes the GRW process to arbitrary 
positive operators Aj(x). In that framework, eq.s ffTU]) and ffT^ get modified, and it is 
then easy to arrive at (I154p starting from fl72]) . 



D Proof of the Marginal Master Equation 

We now provide a proof of the fact, described around fllOll) . that for two non- interacting 
but entangled systems a and b, also the reduced density matrix of system a evolves 
according to the appropriate version of the master equation ([1]). This follows from two 
ingredients: first, 

L(/)=L„(/J®L,(/,), (158) 

which is shown as eq. fll54p in the proof of the marginal probability formula in Ap- 
pendix O and second. 



which is the representation (1901) of the density matrix corresponding to the random 
GRW wave function, applied to system a and system b together. Now it follows, using 
f = fa^ fb, that 



p^ = tr,pr'' = tr, j dfaj rf/,[Lfo,)(/„)®Lfo,)(/.)]pr [Lfo,)(/.)* ® Lfo,)(/.) 
= I d/,Lfo,,)(/,)tr,[pr/a® I rf/, (/.)4,,) (/.)*] Lfo,) (/,)* = 
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which means that the reduced density matrix satisfies the appropriate version of the 
master equation ([1]). 
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